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(57) Abstract 

The present invention discloses novel proteins which are dimers and multimers of single chain antibodies. The single chain 
antibodies contain polypeptides of two domains derived from the variable domains of antibodies which are joined by a peptide 
linker. The dimers and multimers are formed by non-covalent linking of the single chain polypeptides. Uses include all those ap- 
propriate for monclonal and polyclonal antibodies and fragments thereof, including use as a bispeciHc antigen-binding molecule. 
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DIMER AND MULTIMER FORMS OF SINGLE CHAIN POLYPEPTIDES 

This invention relates to the field of molecular biology and production of single 
chain polypeptides. 

5 The basic structure of an antibody is a tetramer composed of two identical 

heterodimers each consisting of a light and a heavy chain. The antibody molecule is divided 
into variable (V) domains that are responsible for binding specificity and constant (C) domains 
that carry out various effector functions. The V domain is constructed of one V homology unit 
from both the light and heavy chains, designated Vl and Vh respectively. The variable region 

1 0 can be further subdivided into framework regions (FR) and complementarity determining 
regions (CDR), also designated as hypervariable regions. The FR maintain the structural 
integrity of the variable region domain. The CDR are the polypeptide segments within the 
variable region that mediate binding to the antigen. As the antibody contains two 
heterodimers, it has two antigen binding sites (divalent). 

-1 5 By using current techniques in molecular biology, it is possible to clone 

polypeptide chains. The more complex the molecules, e.g., the multichain immune receptors of 
the Ig superfamily, the more difficult they are to produce in a single foreign host. Genes coding 
for each chain of a complex molecule can be cloned and expressed in separate hosts but the 
aggregation and refolding of the resultant polypeptides into a biologically artive entity is 

20 difficult to achieve. Multiple chains expressed by multiple genes within the same host have an 
advantage in aggregating and refolding into native structure but their expression in 
stoichiometric amounts is difficult to regulate. Thus, neither approach has proven to be 
efficient. 

Since it is the variable regions of light and heavy chain antibodies that interact 
25 with an antigen, single chain polypeptides have been created with one and one Vh joined by 
a peptide linker (U.S. Patent 4,946,778). This single chain fragment (Fv) is univalent and is one 
of the smallest structures necessary for antigen binding activity having all 6 CDR regions of a 
Vl-Vh combination. 

The use of the functional smaller polypeptides which maintain an antigen 
30 recognition site are also advantageous when used for therapeutic purposes in mammals in that 
the smaller molecules are capable of rapidly localizing in the target tissue, such as single chain 
antibodies localizing in cancerous tissue. However, while being able to penetrate the desired 
tissue, polypeptides with a molecular weight of less than about 50.000 daltons have the 
disadvantage of being retained within the glomerulusof the kidney This is particularly 
35 disadvantageous when the polypeptide is bound with a carrier, such as a radioisotope or toxin, 
as the isotope or toxin will also accumulate in the kidneys. 

Single chain polypeptides also suffer from the disadvantage of having only one 
binding site, thereby reducing their avidity. 

-1- 
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It would therefore be advantageous to obtain constructions of the single-chain 
antibodies which retain their antigen or ligand recognition proF>erties, are of sufficient 
molecular weight to enhance retention at the target site with reduced retention in the kidney, 
and have a multiplicity of binding sitestoenhancetheavidity of the polypeptide. In addition, 

5 it would be beneficial if the single chain immunoglobulin could be rendered bispecific to allow 
for recognition of different epitopes on the target tissue (or lymphocyte) or to allow for 
antibody-based recruitment of other immune effector functions (i.e., complement proteins, 
cytotoxic lymphocytes, etc.). 

Surprisingly it has been found that single chain antibodies will non-covalently 

10 associate to form dimers and multimers. In one embodiment, the invention is a protein 
comprising a dimer or multimer of single chain polypeptides wherein the polypeptides are 
non-covalently linked and each polypeptide comprises (a) a first polypeptide comprising an 
antigen binding portion of a variable domain from an immunoglobulin; (b) a second 
polypeptide comprising an antigen binding portion of a variable domain from an 

1 5 immunoglobulin; and (c) a peptide linker linking the first and second polypeptides (a) and (b) 
into a single chain polypeptide having affinity for an antigen. 

In another embodiment, the invention isa protein comprising a dimer or 
multimer of single-chain polypeptides wherein the polypeptides are non-covalently linked and 
each polypeptide comprises (a) a first polypeptide comprising an antigen binding portion of a 

20 variable domain from an immunoglobulin; and (b) a second polypeptide comprising an 

antigen binding portion of a variable domain of an immunoglobulin wherein polypeptides (a) 
and (b) are joined by a peptide bond. 

In another aspect the present invention includes the aforementioned dimer or 
multimers conjugated to an imaging marker or therapeutic agent. The invention also includes 

25 a composition comprising the dimer or multimer conjugated to an imaging marker or 
therapeutic agent in a pharmaceutically acceptable carrier. 

The invention is also directed to a method for in vitro or in vivo diagnostics of 
cancer which comprises administering to an animal containing a tumor expressing a 
recognizable marker, a pharmaceutically effective amount of a composition containing a dimer 

30 or multimer of single chain polypeptides wherein the first and second polypeptide are specific 
for the recognizable marker. 

In another embodiment, the invention is a method for intraoperative therapy 
which comprises (a) administering a pharmaceutically effeaive amount of the aforementioned 
composition containing di- or multimers of a single chain antibody against a recognizable 

35 marker to a patient containing a tumor expressing this recognizable marker, whereby the 
tumor is localized and (b) excising the localized tumor. 

-2- 
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Instill another embodiment, the invention concerns a process f r preparing and 
expressing the aforementioned dimerand multimers of single chain polypeptides. Such an 
exemplary process comprises: 

(i) providing a genetic sequence coding for a single chain polypeptide; 
5 (ii) transforming a host cell with the genetic sequence; 

(iii) expressing the genetic sequence in the host eel I ; and 

(iv) recoveryof single chain polypeptides which have non-covalently linked to 
form dimers and multimers. 

Brief Description of the Drawings 

Figure 1 shows the digestion of pRW83 to produce vectors used in the 

construction of plasmid pSCFV31 . 

Figure 2 shows the construction of plasmid pSCFV31. 

Figures illustrates the nucleotide sequence and amino acid sequence of SVFVI. 

Figures 4 shows the construction of plasmid pSCFV UNIH. 
^ 5 Figure 5 illustrates the nucleotide sequence and amino acid sequence of Hum4 Vl 

gene Cla l-Hind III segment in pRLIOOl . 

Figure 6A illustrates the plasmid pSCFV VHH and plasmid pSCFV UHM. (B) and (C) 
illustrate a schematic representation of a single chain Fv monomer and single chain Fvdimer, 
respectively. The darkened areas represent the complementarity determining regions. 
20 Figure 7 illustrates the nucleotide sequence and amino acid sequence of a Human 

Subgroup 4 variable light chain (H4\/l) and a murine variable having chain single chain Fv 
immunoglobulin fragment. 

Figure 8 illustrates the DNA sequence and amino acid sequence of CC49 SCFV 
species with a pi of 8. 1 (A), 5.8(8) and 5.2(C). 
25 Figure 9 shows the results of a cation exchange chromatograph activity, 

SDS-PAGEand Western Blot of E. co// pSCFV UHM8.1 periplasmic fraction. 

Figure 10 shows the results of an anion exchange chromatograph, SDS-PA6E and 
Western Blot of Pool 2 (Figure 9) of E co// pSCFV UHM8. 1 periplasmic fraction. 

Figure 1 1 shows the results of a gel filtration chromatograph SDS-PAGE and 
30 isoelectric focusing (lEF) from fraction 1 of the anionic exchange (Figure 10). 

Figure 1 2 shows the results of a gel filtration chromatograph, aai vity SDS-PAG E 
and Western Blot of £. coii pSCFV UHM5-2 periplasmic fraction. 

Figure 13 illustrates the results of a CC49 antibody competition based on moles of 
antibody binding sites. 

35 Figure 14 illustrates the results of a competition EUSA read one hour after 

addition of substrate. 

Figure 15 shows the construction of plasmids pP41, pP42, pP421, pP422, pP431 

and pP432. 

-3- 
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Figure 16 illustrates the DNA sequence and amino acid sequence of CC49 SCFV in 

pPY21(A) and pPY22(B). 

Figure 17 shows the relative activity of CC49 SCFV species obtained fr m 

Pichia pP431 and pP432 clones. 
5 Figure 18 shows the results of gel filtration and SDS-PAGE analysis of Pichia CC49 

scFv species haying a pi of 5.2. 

Figure 19 shows the results of an anion exchange chromatography and SDS-PAGE 
of Pichia CC49 scFv species having a pi of 5.2. 

Figure 20 shows the results of an isoelectric focusing and Western Blot of P/c/)/a 
10 scFv species having a pi of 5.2. 

Figure 21 illustrates the results of a competition ELISA based on (A) moles of 
antibody sites and (B) moles of antibody. 

Figure 22 shows the construaion of plasmids pATDFLAG and pSC49FLAG. 
Figure 23 illustrates the nucleotide sequences and amino acid sequences in 
1 5 plasmid pATDOFLAG. 

Figure 24 illustrates the nucleotide sequences and amino acid sequences in 

plasmid pSC49FLAG. 

Figure 25 shows the results of an M2 affinity chromatograph (A), analytical gel 
filtration (B), SDS-PAGE (C) and Western Blot of H4L 49HF scFv's. 
2Q Figure 26 illustrates the results of binding of H4L49HF single chain species by 

TAG ELISA (A, B and C) or BSM ELISA (D). 

Figure 27 illustrates the results of a competition assay based on moles of antibody 
binding sites of H4L49F single chain species. 

Figure 28 illustrates the results of a competition assay based on moles of H4L49F 
25 single chain species. 

Nucleic acids, amino acids, peptides, protective groups, active groups and such, 
when abbreviated, are abbreviated according to the lUPAC lUB (Commission on Biological 
Nomenclature) orthe praaice in the fields concerned. 

As used herein, the term "non-covaiently linked" means a polypeptide that will 
3Q be separated on the basis of iu molecular weight by size exclusion chromatography when done 
under non-reducing or reducing conditions, but will dissociate into two or more polypeptides 
when subjected to denaturing conditions such as heating and sodium dodecyl sulfate. 

The term "single chain polypeptide" as used herein means a polypeptide from 
two original peptides linked by a peptide linker where the two original peptides are domains 
35 from an antibody and is capable of binding an antigenic determinant or receptor. "Domain" is 
a segment of protein that assumes a discrete funaion, such as antigen binding or antigen 
recognition. 
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A''dimer" is two single chain polypeptides which are non-coval ntly linked and 
the term "multimer" means an even numberofdimers which are non-c valently linked. 
"Vector" means a DNA element used as a vehicle for cloning a fragment of foreign DNA. 

The single chain polypeptides which form dimers and multimers in the present 

5 invention can be obtained from any member immunoglobulin (Ig) family, including IgG, IgM, 
lgE# IgDand IgA. The two peptides in the single chain are generally selected from a light and 
heavy chain variable region of an antibody. Alternatively, it may be desirable to produce 
dimers and multimers of single polypeptides where the two original peptides are derived from 
the same domain (e.g., VL-linker-Vt or VH-linker-VH). The present invention also includes the 

1Q formation of dimers and multimers wherein one single chain polypeptide is directed against 
one antigen and the second or additional polypeptides are directed against a different 
antigen. 

The dimers and multimers of the present invention formed by domains of the 
antibody are advantageous for therapeutic purposes in mammals as they retain their antigenic 

1 5 recognition properties, contain multiple binding sites and are of larger molecular weight than 
the single chain monomers, thereby reducing their retention in the kidneys. 
Isolation of an Appropriate DNA Sequence 

To prepare a vector containing the DNA sequence for an antigen binding portion 
of an Igsuperfamily protein, a source of the genes encoding for these regions will be required. 

20 The appropriate DNA sequence can be obtained from published sources or can be obtained by 
standard procedures known in the art. For example, Kabat et al.. Sequences of Proteins of 
Immunological Interest 4th ed. ,(1991), published by The U.S. Department of Health and 
Human Services, discloses sequences of most of the Ab variable regions which have been 
described to date, WO 90/044 1 0 published May 3, 1 990 discloses a DNA sequence for variable 

25 regions of antibodies against tumor associated antigen (TAG). 

When the genetic sequence is unknown, it is generally possible to utilize cDNA 
sequences obtained from mRNA by reverse transcriptase mediated synthesis as a source of DNA 
to clone into a vector. For antibodies, the source of mRNA can be obtained from a wide range 
of hybridomas. See, for example, the catalogue ATCC Cell Lines and Hybridomas, American 

30 Type Culture Collection, 20309 Parklawn Drive, Rockville Md., USA (1990). Hybridomas 

secreting monoclonal antibodies reactive with a wide variety of antigens are listed therein, are 
available from the collection, and usable in the present invention. These cell lines and others of 
similar nature can be utilized as a source of mRNA coding for the variable region or to obtain 
protein to determine amino acid sequence from the monoclonal antibody itself. The specificity 

35 ofthe antibody to be engineered will be determined by the original seleaion process. The class 
of antibody can be determined by criteria known to those skilled in the art. 

Variable regions of antibodies can also be derived by immunizing an appropriate 
vertebrate, normally a domestic animal, and most conveniently a mouse. The immunogen will 
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be the antigen of interest, or where a hapten, an antigenic conjugate of the hapten to an 
antigen such as keyhole limpet hemocyanin (KLH). The immunization may be carried ut 
conventionally with one or more repeated injections of the immunogen into the host mammal, 
normally at two to three week intervals. Usually three days after the last challenge, the spleen 

5 is removed and dissociated into single eel Is to be used for cell fusion to provide hybridomas 
from which mRNA can readily be obtained by standard procedures known in the art. 

From cultured lymphocytes or hybridomas. mRNA can be obtained which in turn 
is utilized to produce cDNA. From the pool of cDNA, the desired cDNA can be amplified using 
such techniques as the polymerase chain reaction (PGR) or PGR with splicing by overlap 

10 extension (PCR-SOE). PGR in essence involves exponentially amplifying DNA in vitro using 

sequence specified 5' and 3' oligonucleotides. PGR is described in Mullins etal.. Meth. Enz., 155. 
335-350(1987); and PCR Technofogy, Eriich (ed.) (1989). 

When an antibody of interest is obtained, and only its amino acid sequence is 
known, it is possible to reverse translate the sequence. 

^ 5 To form the single chain polypeptides which form dimers and multimers it is 

generally necessary to have a suitable peptide linker which links the first and second 
polypeptides. Suitable linkers for joining the polypeptides are those which allow the separate 
polypeptide chains to fold into a single polypeptide chain which will have a three dimensional 
strurture very similar to the original struaure made of two polypeptide chains, and allows 

20 non-covalent linkage of single chains to form dimers and multimers of the single polypeptide 
chain. The linker must thus be (1 ) sufficiently large to span the distance between the two 
polypeptides; (2) sufficiently flexible to allow the association of the two polypeptides; and 
(3) relatively hydrophilic. being on the water-accessible surface of the molecule. Obtaining a 
three dimensional structure similar to the original antibody molecule allows the resulting 

25 single chain polypeptide, dimer or multimer to recognize and react with the initial antigen. 
Generally, a linker which allows separate polypeptide chains to fold into a single polypeptide 
chain with a three dimensional struaure similar to the original structure made of two 
polypeptide chains should also allow formation of dimer and multimers. Linkers having the 
desired properties can be obtained by the method disclosed in U.S. Patent 4,946,778, the 

30 disclosure of which is hereby incorporated by reference. From the polypeptide sequences 
generated by the methods described in the 4,946,778, genetic sequences coding for the 
polypeptide can be obtained. 

It is also necessary that the linker peptide be attached to the peptides such that 
the binding of the linker to the individual polypeptides does not interfere with the binding 

35 capacity of the antigen recognition site. 

A preferred linker is the helical linker designated 205 as disclosed in 
Pantolianoetal. B/oc/iem., 30, 10117-10125(1991). 

-6- 
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Another preferred linker is 205C, which has the amino acid sequence of Leu Ser 
Ala Asp Asp Ala Lys Lys Asp Ala Ala Ala Lys Lys Asp Asp Ala Lys Lys Asp Asp Ala Lys Lys Asp Leu. 

The linker is generally about 10 to about 50 amino acid residues. Preferably the 
linker is about 10 to about 30 amino acid residues. More preferably the linker is about 12 to 

5 about 30 amino acid residues. Most preferred is a linker of about 15 to about 25 amino acid 
residues. > 

In another embodiment, the invention comprises two variable domains joined by 
a peptide bond without an intervening linker polypeptide. Examples include Vl-Vl, Vh-Vh, 
Vl-Vh and Vh-Vl single chain polypeptidefs. Such construas would maximize inter-polypeptide 

10 chain interactions to form multimers while minimizing the potential for intra-polypeptide 
chain interactions to form monomers. 

Expression vehicles for production of the molecules of the invention include 
plasmids or other vectors. In general, such vectors contain replicon and control sequences 
which are derived from species compatible with a host cell. The vector ordinarily carries a 

15 replicon site, as well as specific genes which are capable of providing phenotypic selection in 
transformed cells. For example, E. coli is readily transformed using pBR322 [Bolivar et al., Gene, 
2, 95- (1977), or Sambrook et al., Mo/ecu/ar Cloning, Cold Spring Harbor Press. New York, 2nd 
Ed. (1989)1, a plasmid derived from an E. coli species which contains genes for ampicillin and 
tetracycline resistance, and thus provides an easy means for identifying transformed cells. 

20 Expression vectors compatible with procaryotic cells are well known in the art and are available 
from commercial sources, e.g., pBR325 from Gibco Biological Research Laboratories. Typical of 
vertor plasmids suitable for procaryotic cells are pUC8, pUC9, pBR322 and PBR329 available 
from Biorad Laboratories (Richmond, CA) and pPL and pKK222 available from Pharmacia-LKB 
(Piscataway, NJ), and the Bluesript SK/KS vectors from Stratagene (LaJolla, CA). 

25 Plasmids suitable for eukaryotic microbes may also be used. S. cerevisiae, or 

common baker's yeast, is the most commonly used among eukaryotic microorganisms although 
a number of other strains, such as Pichia pastoris, are available. Cultures of cells derived from 
multicellular organisms such as Sp2/0 or Chinese Hamster Ovary (CHO), which are available 
from the ATCC, may also be used as hosts. Typical of vector plasmids suitable for eukaryotic 

30 cells are p5V2neo and pSV2gpt (ATCC); pSVL and pK5V-10 (Pharmacia), pBPV-l/pML2d 
(International Biotechnology, Inc.). 

The use of viral expression vectors to express the genes for polypeptides of the 
present invention is also contemplated. As used herein, the term "viral expression vector" 
refers to a DMA molecule that includes a promoter sequence derived from the long terminal 

35 repeat (LTR) region of a viral genome. Exemplary phage include A phage and fd phage (See, 
Sambrook et aL, supra; McCaf^erty et al.. Nature, 6301 , 552-554 (1990). 

It is preferred that the expression vectors and the inserts which code for the single 
chain polypeptides have compatible restrirtion sites at the insertion junctions and that those 
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restriction sites are unique to the areas of insertion. Both vector and insert are treated with 
restriction endonudeases and then ligated by any of a variety of methods such as those 
described in Sambrook et al., supra. 

Preferred genetic constructions of vectors for production of dinners and multimers 

5 of the present invention are those which contain a constitutively active transcriptional 

promoter, a region encoding signal peptide which will direct synthesis/secretion of the nascent 
single chain polypeptide out of the cytoplasm into either an intracellular compartment (in 
eukaryotes) or the cell exterior (or periplasm in prokaryotes). Preferably, the expression rate is 
commensurate with the transport, folding and assembly steps to avoid accumulation of the 

10 polypeptide as insoluble material. In addition to the replicon and control sequences, 

additional elements may also be needed for optimal synthesis of single chain polypeptide. 
These elements may include splice signals, as well as transcription promoter, enhancers, and 
termination signals. 

Vectors which are commercially available can easily be altered to meet the above 

1 5 criteria for a veaor. Such alterations are easily performed by those of ordinary skill in the art i n 
light of the available literature and the teachings herein. 

Additionally, it is preferred that the cloning vector contain a selectable marker, 
such as a drug resistance marker or other marker which causes expression of a selectable trait 
by the host microorganism. Such a drug resistance or other selectable marker is intended in 

20 part to facilitate in the selection of transformants. Additionally, the presence of a selectable 
marker, such as a drug resistance marker, may be of use in keeping contaminating 
microorganisms from multiplying in the culture medium. In this embodiment, such a pure 
culture of the transformed host microorganisms would be obtained by culturing the 
microorganisms under conditions which require the induced phenotype for survival. 

25 Recovery and purification of the dimers and multimers of the present invention 

can be accomplished using standard techniques known in the art. For example, if the 
polypeptides are excreted into the culture medium, the polypeptides can be concentrated by 
ultrafiltration and then separated from monomers by gel filtration. When the polypeptides are 
transported to theperiplasmic space of a host cell, purification can be accomplished by 

30 osmotically shocking the cells, and proceeding with ultrafiltration, antigen affinity column 
chromatography or column chromatography using an ion exchange chromatography and gel 
filtration. Polypeptides which are insoluble and present as refractile bodies, also called 
inclusion bodies, can be purified by lysis of the cells, repeated centrifugation and washing to 
isolate the inclusion bodies, solubilization, such as with guanidine-HCI, and then passage of the 

35 solubilized material through a gel filtration column. 

While not wishing to be bound by theory, it is believed the dimers and multimers 
of the present invention form as a result of two discrete physicochemical states (isoforms) of 
the single chain polypeptide. In one transition state (A) or folding intermediate, only 
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monomers will be formed. In the s cond transition slat (B) only dimers, or multiples thereof 
will be formed. The transition states are believed to be related to a conformational or charge 
equilibrium at a single residue or region in either polypeptide of the single chain polypeptide 
or even in the linker joining the two. During synthesis, since the single chain polypeptide has 

5 not yet assumed a final stable three dimensional structure, the single chain polypeptide is 
_ thought to be fairly random jn configuration. Single chain polypeptides in this later transition 
state are capable of forming interchain dimers, a dimer being capable of non-covalently linking 
to another d i mer or multi mer. 

The activity of the single chain polypeptides can be measured by standard assays 

10 known in the art, for example competition assays, enzyme-linked immunosorbant assay 
(ELISA), and radioimmunoassay (R1A). 
USES 

Thedimersand multimersof the present invention provide unique benefits for 
use in diagnostic and therapeutics, such as for use in diagnosis or therapy of a variety of 

1 5 diseases. In particular, the dimers and multimers are useful for, but not limited to, in vivo and 
in vitro uses in diagnostics, therapy, imaging and biosensors. 

Essentially all of the uses for which monoclonal or polyclonal antibodies, or 
fragments thereof, have been envisioned by the prior art, can be addressed by the multivalent 
proteins of the present invention. These uses include detectably-labelled forms of the 

20 rnultivalent protein. Types of labels are well-known to those of ordinary skill in the art. They 
include radiolabelling, chemiluminescent labeling, fluorochromic labelling, and chromophoric 
labeling. Other uses include imaging the internal structure of an animal (including a human) 
by administering an effective amount of a labelled form of the multivalent protein and 
measuring detectable radiation associated with the animaL They also include improved 

25 immunoassays, including sandwich immunoassay, competitive immunoassay, and other 
immunoassays wherein the labelled antibody can be replaced by the multivalent 
antigen-binding protein of this invention. 

The dimers and multimers may be incorporated into a pharmaceutically 
acceptable carrier. Injectable compositions of the present invention may be either in 

30 suspension or solution form. In solution form the complex (or when desired the separate 
components) is dissolved in a pharmaceutically acceptable carrier. Such carriers comprise a 
suitable solvent, preservatives such as benzyl alcohol, if needed, and buffers. Useful solvents 
include, for example, water, aqueous alcohols, glycols, and phosphonate or carbonate esters. 
Such aqueous solutions generally contain no more than SO percent of the organic solvent by 

35 volume. 

Dimers and multimers composed of antigen specific antibody domains are 
particularly advantageous for use in the diagnosis and/or therapy of diseases, such as cancer, 
where target antigens are often expressed on the surface of cells. For diagnostic and/or 
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therapeutic uses, thedimers or multimers can be conjugated with an appropriate imaging or 
therapeutic agent. Examples of antibodies from which the Vl and Vh domains can be obtained 
to produce dimers or multimers are CC antibodies, such as CC49, disclosed in published PCX 
Application WO 89/00692 on January 26, 1 989 and published PCT Application WO 90/0441 0 on 
5 May 3, 1990. 

Methods for preparing and administering conjugates of the dimers and 
multimers are accomplished by methods well known or readily determined, as described, for 
example, in Goldenberg et al.. New England], Med., 298, 1384-1388 (1978); 
Goldenberg et al. .Gastroentero/., 84, 524-532 (1983); Siccardi et al.. Cancer Res., 46, 4817-4822 

10 (1985); Keenan et al., J, NucL Med,, 25, 1 197-1203 (1984); Meares et al.. Ana!, Biochem., 142, 
68-78 (1984). Moreover, suitable dosages will depend on the age and weight of the patient 
and the therapeutic agent employed and are well known or readily determined. 

Conjugates of the dimers and multimers and an imaging marker are administered 
in a pharmaceutically effeaive amount for the in vivo diagnostic assays, and then detecting the 

15 presence of the imaging marker by appropriate detection means. Generally, the dosage should 
be effective to visualize or detect tumor sites, distinct from normal tissues. Preferably, a one- 
time dosage will be between 0.1 mg to 200 mg of the conjugate of the dimer or multimer and 
imaging marker per patient. 

Examples of imaging markers which can be conj ugated to the dimers and 

20 multimers are well known and include substances which can be detected by diagnostic imaging 
using a gamma scanner or hand held gamma probe, and substances which can be detected by 
nuclear magnetic resonance imaging using a nuclear magnetic resonance spectrometer 

Suitable but not limiting examples of substances which can be detected using a 
gamma scanner include 12S|^ I3i|^ 123|, inin, ^OSRh, i53Sm, 67Cu,67Ga. 166ho, 177Lu, i86Re, i88Re 

25 and 99mTc. An example of a substance which can be detected using a nuclear magnetic 
resonance spectrometer is gadolinium. 

A pharmaceutically effective amount of a composition containing dimers and 
multimers of antibodies is that which should be sufficient to achieve effective binding with the 
antigens against which the antibodies have specific affinity. 

20 Examples of antibody-therapeutic agent conjugates which can be used in therapy 

include dimers or multimers of antibodies coupled to radionuclides, such as 1 3i |^ 90y, iosrh, 
^7Sc. 67Cu, 2i2Bi, 211 At. 67Ga, 125|, 186R€, i88Re, i77lu, 99mTc, i53Sm, 123| and ^In; to drugs such 
as methotrexate, adriamycin; to biological response modifiers, such as interferon and to toxins, 
such as ricin. 

25 Methods of preparing and administering conjugates of the dimers and multimers 

of antibodies and a therapeutic agent are well known or readily determined. The 
pharmaceutical composition may be administered in a single dose or multipl dosage form. 
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Moreover, suitable dosages will depend on the age and weight of the patient and the 
therapeutic agent ennployed and are well known or readily determined. 

Dimers and nnultimers containing single chain antibodies are particularly suitable 
for radioimmunoguided surgery (RIGS). In RIGS, an antibody labeled with an imaging marker is 
injected into a patient having a tumor that expresses a target antigen. The antibody localizes 
to the tumor and is detected by a hand-held gamma detecting probe (GDP). The tumor is then 
excised (see. Martin et al. Amer, J. Surg., 156, 38G-392 (1988); Martin et al., Hybhdoma, 5, 
S97-S108). An exemplary GDP is the Neoprobe~ scanner, commercially available from 
Neoprobe Corporation, Columbus, OH. 

The invention will be further clarified by a consideration of the following 
examples, which are intended to be purely exemplary of the present invention. 



15 



20 



25 



30 



35 
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ABBREVIATIONS 



5 



10 



15 



20 



BCIP 


5-bromo-4-chloro-3-indoyl phosphate 


bp 


base pair 


Bis-Tris 


( 1 , 3-bis[t r IS ( hydroxymethyl ) -methylaminoj 


propane 


propane ) 


BSA 


bovine serum albumin 


CDR 


Complementarity determining region 


ELISA 


enzyme linked immunosorbent assay 




non— covalent single chain Fv dimer 




isoelectric focusina 


Kbp 


kilo base pair 


LB 


Lur ia-Ber tani medium 


Mab 


monoclonal antibody 


MES 


2— (N-Morpholino) ethane sulfonic acid 


MW 


molecular weight 


NET 


nitro blue tetrazolium chloride 


PAG 


polyacrylamide gel 


PAGE 


polyacrylamide gel electrophoresis 


PBS 


phosphate buffered saline 


PGR 


polymerase chain reaction 


pSCFV 


plasmid containing DNA sequence coding 
for SCFV 






RIT 


radioixmnunotherapy 


scFv 


single chain Fv immunoglobulin fragment 


SDS 


sodium dodecyl sulfate 


TBS 


Tris-buf f ered saline 


Tris 


( Tr is[hydroxymethyl]aminomethane ) 


TTBS 


Tween-20 wash solution 


Vh 


immunoglobulin heavy chain variable 
domain 


Vl 


inmiunoglobulin light chain variable 
domain 



35 
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Antibodies 

CC49 : A murine monoclonal antibody specific to the human tumor-associated 
glycoprotein 72 (TAG-72) deposited as ATCC No, HB9459. 

CC49 FAB : An antigen binding portion of CC49 consisting of an intact light chain 
5 linked to the N-terminal portion of the heavy chain. 

ChCC49: Chimeric CC49 has the murine constant regions of the CC49 antibody 
replaced with the corresponding human constant regions. 

CC49 scFv(X) : Single chain antibody fragment consisting of two variable domains 
of CC49 antibody joined by a peptide linker. X refers to the theoretical pi of thescFv(e.g.. CC49 
10 scFvS.I). 

CC49 Fv2 : Two CC49 scFv non-covalentiy linked to form a dimer. The number 
after Fv refers to the number of monomer subunits of a given molecule, e.g., CC49 Fv6 refers to 
the hexamer multimers. 

Hum4 Vi or H4VL : Variable domain of the human Subgroup IV kappa light chain. 
5 SCFV UHM(X) : SCFV consisting of a CC49 variable light chain and a CC49 variable 

heavy chain joined by a 25 amino acid linker named UNIHOPE. {X) refers to the theoretical pi of 
the SCFV which changes as amino acids are added to the C terminus, e.g., scFv U HM8. 1 ; scFv 
UHM5.2 and scFv UHM5.2. 

SCFVUHH: SCFV consisting of a Hum4VLand a CC49 variable heavy chain joined 
20 by a 25 amino acid linker named UNIHOPE. 

EXAMPLES 

General Experimental 

Procedures for molecular cloning are as those described in Sambrook et al., 
Molecular Cloning, Cold Spring Harbor Press, New York, 2nd Ed. (1989) and Ausubel et al., 
25 Current Protocols in Molecular Biology, }ohn Wiley and Sons, New York (1992). 

All water used throughout was deionized distilled water 
Oligonucleotide Synthesis and Purification 

All oligonuclotides (oligos) were synthesized on either a Model 380A or a 
Model 391 DNA Synthesizer from Applied Biosystems (Foster City, CA) using standard 
30 cyanoethyl phosphoramidites and synthesis columns. Protecting groups on the produa were 
removed by heating in concentrated ammonium hydroxide at 55**Cfor 6to 15 hours. The 
ammonium hydroxide was removed through evaporation and the crude mixtures were 
resuspended in 30 to 40 of sterile water. After elearophoresis on polyacrylamide-urea gels, 
the oligos were visualized using short wavelength ultraviolet (UV) light. DNA bands were 
35 excised from the gel and eluted into 1 mL of 100 mM Tris-HCi, pH 7.4, 500 mM NaCI, 5 mM EDTA 
over 2 hours at 65°C. Final purification was achieved by applying the DNA to Sep-Pac"" C-18 
columns (Millipore, Bedford. MA) and eluting the bound oligos with 60 percent methanol. The 
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solution volume was reduced to approximately 50 \iL and the DNA concentration was 
determined by measuring the optical density at 260 nm (OD26o)- 
Bacterial Strains 

Both Escherida coli (f. co//) RR1 and f. co/i GM 161 (dam ), containing the plasmid 

5 pRW83, were obtained from Dr. J. O. Lampen (Rutgers University, New Brunswick, NJ) 

E, coli DH1 was purchased from Invitrogen (San Diego, CA) and competent £. co// AG1 was 
purchased from Stratagene (La Jolla, CA). The plasmid pCGSSIS was obtained from 
Collaborative Research, Inc. (Bedford, MA). 
Restriction Enzyme Digests 

All restriction enzyme digests were performed using Bethesda Research 
Laboratories (Gaithersburg, MD), or New England Biolabs, Inc, (Beverly, MA) enzymes and 
buffers following the manufacturer's recommended procedures. Digested products were 
separated by polyacrylamide gel electrophoresis (PAGE). The gels were stained with ethidium 
bromide, the DNA bands were visualized using long wavelength U V light and the DNA bands 

15 were then excised. The gel slices were placed In dialysis tubing (Union Carbide Corp., Chicago) 
containing 5 mM Tris, 2.5 mM acetic acid, 1 mM EDTA, pH 8.0 and eluted using a Max 
Submarine electrophoresis apparatus (Hoefer Scientific Instruments, CA). Sample volumes 
were reduced on a Speed Vac Concentrator (Savant Instruments, Inc., NY). The DNA was 
ethanol precipitated and redissolved in sterile water. 

20 Enzyme Linked Immunosorbent Assay (ELISA) 

TAG-72 antigen, prepared substantially as described by Johnson et al. Can. Res., 
46 , 850-857 (1986), was ac rorbed onto the wells of a polyvinyl chloride 96 well microtiter plate 
(Dynatech Laboratories, Inc., Chantilly, VA) by drying overnight. The plate was blocked with 
1 percent BSA in PBS for 1 hour at 3rC and then washed 3 times with 200 pL of PBS, 

25 0.05 percent Tween-20. 25 pL of test antibodies and 25 pL of biotinylated CC49 (1/20,000 

dilution of a 1 mg/mL solution) were added to the wells and the plate incubated for 30 minutes 
at BI^C. The relative amounts of TAG-72 bound to the plate, biotinylated CC49, streptavidin- 
alkaline phosphatase, and color development times were determined empirically in order not 
to have excess of either antigen or biotinylated CC49, yet have enough signal to detect 

30 competition by scFv. Positive controls were CC49 at 5 pg/mL and CC49 Fab at 1 0 pg/mL. 

Negative controls were 1 percent BSA in PBS and/or concentrated LB. Unbound proteins were 
washed away. 50 pL of a 1:1000 dilution of streptavidin conjugated with alkaline phosphatase 
(Southern Biotechnology Associates, Inc., Birmingham, AL) were added and the plate was 
incubated for 30 minutes at 3 TC. The plate was washed 3 more times. 50 pLof a 

35 para-nitrophenyl-phosphate solution (Kirkegaard & Perry Laboratories, Inc., Gaithersburg, MD) 
were added and the color reaction was allowed to develop for a minimum of 20 minutes. The 
relative amount f scFv binding was measured by optical density scanning at 404-450 nm using 
a micropiate reader (Molecular Devices Corporation, .Manio Park, CA). Binding of the scFv 
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resulted in decreased binding of the biotinylated CC49 with a concomitant decrease in color 
development. 

Amino-Terminal Sequence Analysis 

Amino terminal amino acid sequencing was performed by the Edman 

5 degradation procedure using an Applied Biosystems mode! 470A Protein Sequencer (Foster 
City, CA). Frartions were analyzed with an interface to an Applied Biosysterns Mode! 1 20A 
Analyzer (Hewrick, M.W. et al.). Protein samples (about 1 0 pg each) were prepared by 
removing PBS present in the sample by 2 cycles of water dilution and concentration using a 
Centrican 30 device (Amicon). 

10 SDS-PAGE and Western Blotting 

Samples for SDS-PAGE analysis (20 pL) were prepared by boiling in a non-reducing 
sample preparation buffer-Seprasol I (Integrated Separation Systems (ISS), Natick, MA) for 
5 minutes and loaded on 10-20 percent gradient polyacryiamide Daiichi Minigelsas per the 
manufacturer's directions (ISS). 

1 5 Electrophoresis was conducted using a Mini 2-ge! apparatus (ISS) at 55 mA per gel 

atconstant current for approximately 75 minutes. Gels were stained in Coomassie Brilliant Blue 
R-250 (Bio-Rad, Richmond, CA) for at least 1 hour and destained. Molecular weight standards 
were prestained (Mid Range Kit, Diversified Biotech, Newton Center, MA) and included the 
following proteins: Phcsphorylase b, glutamate dehydrogenase, ovalbumin, lactate 

20 dehydrogenase, carbonic amhydrase, B-laaoglobulin and cytochrome C. The corresponding 
MWs are: 95,500, 55,000, 43,000, 36,000, 29,000, 18,400, and 12,400, respectively. 

When Western analyses were conducted, a duplicate gel was also run. After 
electrophoresis, one of the gels was equilibrated for 15-20 minutes in anode buffer #1 (0.3 M 
Tris-HCI pH 10.4). An Immobilon-P PVDF (polyvinylidene dichlorine) membrane (Millipone, 

25 Bedford, MA) was treated with methanol for 2 seconds, and immersed in water for 2 minutes. 
The membrane was then equilibrated in anode buffer #1 for 3 minutes. A Milliblot-SDE 
apparatus (Miliipore) was utilized to transfer proteins in the gel to the membrane. A drop of 
anode buffer # 1 was placed in the middle of the anode electrode surface. A sheet of Whatman 
3MM filter paper was soaked in anode buffer #1 and smoothly placed on the elearode surface. 

30 Another filter paper soaked in anode buffer #2 (25 mM tris pH 10.4) was placed on top of the 
first one. A sandwich was made by next adding the wetted PVDF membrane, placing the 
equilibrated gel on top of this and finally adding a sheet of filter paper soaked in cathode 
buffer (25mM Tris-HCI, pH 9.4 in 40 mM glycine). The screws for the cathode were firmly 
secured. Transfer was accomplished in 30 minutes u<ino 250 mA constant Current (initial 

35 voltage ranged from 8-20 volts). 

After blotting, the membrane was rinsed briefly in water and placed in a dish 
with 20 mL blocking solution (1 percent bovine serum albumin (BSA) (Sigma, St. Louis, MO) in 
Tris-buffered saline (TBS)). TBS was purchased from Pierce Chemical (Rockford, IL) as a 
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pr weighed powder such that when 500 mL water is added, the mixture gives a 25 mM Tris, 
0.15 M sodium chloride solution at pH 7.6. The membranes were blocked for a minimum of 1 
hour at ambient temperature and then washed 3 times for 5 minut s each using 20 mL 0.5 
percent Tween-20 wash solution (TTBS). To prepare the TTBS. 0.5m L of Tween 20 (Sigma) was 
5 mixed per liter of TBS. The probe antibody used was 20 mLbiotinylated FAID14 solution (10 ^ig 
per 20 mL antibody buffer). Antibody buffer was made by adding 1 g BSA per 100 mL of TTBS. 
After probing for 30-60 minutes at ambient temperature, the membrane was washed 3 times 

with TTBS, as above. 

Next, the membrane was incubated for 30-60 minutes at ambient temperature 

^0 with 20 mL of a 1 :500 dilution in antibody buffer of streptavidin conjugated with alkaline 
phosphatase (Southern Biotechnology Associates. Birmingham. AL). The wash step was again 
repeated after this, as above. Prior to the color reartion, membranes were washed for 2 
minutes in an alkaline carbonate buffer (20 mL). This buffer is 0.1 M sodium bicarbonate, 1 mM 
MgCI2.H20, pH 9.8. To make up the substrate for alkaline phosphatase, nitroblue tetrazolium 

15 (NBT)chloride{50mg, Sigma) was dissplved in 70 percent dimethylformamide. 5-Bromo- 
-4-chloro-3-indoyl phosphate (BCIP) (25 mg, Sigma) was separately dissolved In 100 percent 
dimethylformamide. 5-Bromo-4-chloro-3-indoyl phosphate (BCIP) 25 mg, Sigma) was 
separately dissolved in 100 percent dimethylformamide. These solutions are also commercially 
available as a Western developing agent sold by Promega. These solutions were stored at 4'*C 

20 in the dark for up to 3 months. For color development, 120^Lof each were added to the 

alkaline solution above and allowed to react for 1 5 minutes before they were washed from the 
developed membranes with water. 
Biotinvlated FAID14 

FAID14 is a murine anti-idiotypic antibody (lgG2a, K isotype) deposited as ATCC 

25 No. CRL 10256 directed against CC49. FAID14 was purified using a Nygene Protein A affinity 
column (Yonkers, NY). The manufacturer's protocol was followed, except that 0.1 M sodium 
citrate, pH 3.0 was used as the elution buffer. Fraaions were neutralized to pH -7 using 1 .0 M 
Tris-HCI pH 9.0. The biotinylation reaaion was set up as follows. FAID14(1 mg, lOOjiLin 
water) was mixed with 100 pL of 0.1 M Na2C03 pH 9.6. Biotinyl-e-amino-caproic acid N-hydroxy 

30 succinimide ester (Biotin-X-NHS) (Calbiochem, LaJolla, CA) (2.5 mg) was dissolved in 0,5 mL 
dimethylsulf oxide. Biotin-X-NHS solution (20 pL) was added to the FAID1 4 solution and 
allowed to react at 22*'C for 4 h. Excess biotin and impurities were removed by gel filtration, 
using a Pharmacia Superose 12 HR 10/30 column (Piscataway, NJ). At a flow rate of 0.8 m L^min, 
the biotinylsted FA!D14 emerged with a peak at 16.8 min. The fractions making up this peak 

35 were pooled and stored at 4**C and used to detect the CC49 idiotype as determined by the CC49 
Vl and Vh CDRs. 
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Isoelectric Focusing (lEF) and Western Analysi s 

Isoelectric points (pi's) were predicted using a computer program called 
PROTEIN-TITRATE, available through DNASTAR (Madison, Wl). Based on amino acid 
composition with an input sequence, a MW value is given, in addition to the pi. Since Cys 

5 residues contribute to the charge, the count was adjusted to 0 for Cys, since they are all 
involved in disulfide bonds.. ^ 

Experimentally, pi's were determined using Isogel agarose lEF plates, pH range 
3-10 (FMC Bioproducts, Rockland, ME). ABiorad Bio-phoresis horizontal electrophoresis cell 
was used to run the lEF, following the directions of both manufacturers. The electrophoresis 

^0 conditions were: 500 volts (limiting), at 20 mA current and 1 0 W of constant power. Focusing 
was complete in 90 min. lEF standards were purchased from Biorad; the kit included 
phycocyanin, p-lactoglobulln B, bovine carbonic anhydrase, human carbonic anhydrase, equine 
myoglobin, human hemoglobins A and C, 3 lentil lectins and cytochrome C, with pi values of 
4.65, 5. 1 0, 6.00, 6.50, 7.00, 7. 1 0 and 7.50, 7.80, 8.00, and 8.20 and 9.60 respectively. Gels were 

15 stained and destained according to the directions provided by FMC. 

. For lEF Westerns, duplicate gels were electrophoresed. Focused proteins in the 
second gel were transferred to PVDF membranes, as above, using a Biorad Model 483 slab dryer 
hooked up to a dry ice trap and vacuum pump. A piece of fi Iter paper wetted in water was 
placed on the drier surface, followed by a prepared PVDF membrane, and then by the gel itself, 

20 with its sol id support facing up. The plastic cover of the drier was then placed on top of this 
"sandwich". The drier was operated under vacuum without heat for 20 min to transfer the 
proteins in the gel to the membrane. Following the transfer, the blotted membrane was 
blocked and subsequently probed with biotinylated FAID 14 as described above for SDS-PAGE. 
Quantitation of CC49 Antibody Species 

25 Purified CC49 antibodies were quantitated by measuring the absorbence of 

protein dilutions at 280 mm using matching 1 .0 mL 1 .0 cm pathlength quartz cuvettes (Hellma) 
and a Perkin-Elmer UVA/IS Spertrophotometer, Model 552A. Molar absorptivities (Em) were 
determined for each antibody by using the following formula: 

Em = (numberTrp)X 5,500 + (number Tyr)X 1,340 + 

3Q (number (Cys)2) X 1 50 + (number Phe) X 1 0 

The values are based on information given by D. B. Wetlaufer, Advances in Protein Chemistry, 
17, 375-378). 

High Performance Liquid Chromatography 

All high performance liquid chromatography (HPLC) was performed using ar. 
35 LKB HPLC system with titanium or teflon tubing throughout. The system consists of the 

Model 2150 HPLC pump. Model 2152 controller, UV CORD Sll Model 2238 detection system set 
at an absorbance of 276 nm and the Model 2211 SuperRac fraction collector. 
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EXAMPLE 1 Formati n of Single Chain Antibody Dimers in f. co!i 

A. Preparation of pSCFV 31 

A vector was prepared f r m plasmid pRW 83 containing a chloramph nicol 

resistance (CamO gene for clone selection, and a penP gene with a penPpromoter and 
5 terminator(see Me2esetal-(1983),J. B/o/. Chem.. 258, 1 1211-11218) and the pe/S signal 

sequence {see Lei eta\.,J. Bact. 169, 4379-4983(1987)]. The veaor designated Fragment A (see 

Figure 1) was prepared by removal of the penP gene with a H/nd 1 1 l/Sa/ 1 digest. 

The penP promoter and pe/E signal sequence were obtained by a PCR using 

pRW 83 as a template and oligonucleotides penPI and penP2 as primers. The fragment was 
10 designated Fragment B (see Figure 1). ANco I enzyme restrialon site was introduced at the 

3' end of the signal sequence region by the penP2 oligonucleotide. 

penPI: 

5*-CGATAAGCTTGAATTCCATCACTTCC-3' 
penP2: 

15 5'-GGCCATGGCTGGTTGGGCAGCGAGTAATAACAATC(L^GCG GCT 

G CCGTAG GCAATAGGTATTTCATCAAAATCGTCTCCCTCCGTTTG AA-3' 

A scFv comprised of a Hum4 Vl, a CC49 Vh, and an 1 8 amino acid linker (Lys Glu 

Ser Gly Ser Val Ser Ser Glu Gin Leu Ala Gin Phe Arg Ser Leu Asp) was obtained from 

pCGS515/SCFV1 by PCR using oligonucleotides penP3 and penPS. This fragment was 
20 designated Fragment D (see Figure 2). The complete nucleotide sequence and amino acid 

sequence of pCGSSIS/SCFV 1 is given in Figure 3. A Bd I site was introduced at the 3' end of the 

Vh region by the penP6 oligonucleotide. 

p€nP3: 

5'-GCTGCCCAACCAGCCATGGCCGACATCGTGATGACCCAGTCTCC-3' 
25 penP6(-): 

5'-CTCTTGATCACCAAGTGACTTTATGTAAGATGATGTTTTG ACG 
GATTCATCGCAATGTTTTTATTTGCCGGAGACGGTGACTGAGGTTCC-3' 

Fragments B and D were joined by SOE-PCR (splicing by overlap 
extention-poiymerase chain reaaion) using oligonucleotides penPl and penP6, following the 
30 procedures of Norton et al.. Gene, 77, 61-68 (1 989). The new fragment was designated E (See 
Figure 2). 

Fragment C, containing the penPtermination codon. was isolated by digesting 
pRW83 with Bd I and 5a/ 1. pRW83 was isolated from f. co/i strain GM 161, which is DNA 
methylase minus or dam . 
35 Plasmid pSCFV 3 1 (see Figure 2) was created with a tnree part ligation of 

Fragments A, C, and E. 
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B. Preparation of oSCFV 31b 

The A/CO I restriaion enzyme site within the Camr gene and the Hmd\i\ site 
located at the 5' end of the penP promoter in pSCFV 31 were destroyed through a PGR DNA 
amplification using oligonucleotides Ncol.1 and Ncol.3(-) to generate an £co Rl to A/co I 
5 fragment and oligonucleotides Ncol .2 and Ncol .4c(-) to generate a Nco I to £co Rl fragment. 
These two fragments were joined by PCR-SOE using oligonucleotides Ncol .1 and Ncol .4c(-). 
The oligonucleotides are set forth below: 
Ncol.l: 

5'-TCCG G AATTCCGT ATG G CAATG A-3' 
10 Nco1.3(-): 

5'-CrTGCGTATAATATTTGCCCATCGTGAAAACGGGGGC-3' 
Ncol. 2: 

5'-ATGGGCAAATATTATACGCAAG-3' 
Nco1.4c(-): 

1 5 5*-CACTG AATTCATCGATG ATAAGCTGTCAAACATG AG-3' 

pSCFV 3 1 was digested with Eco Rl and the larger fragment was isolated by 
polyacrylamidegel elearophoresis. To prevent self ligation, the DNA wasdephosphorylated 
using calf intestinal alkaline phosphatase according to the teachings of Sambrook et al., supra. 

A two part ligation of the larger pSCFV 31 digested fragment and the PCR-SOE 
20 fragment, described above, resulted in the creation of pSCFV 31b (see Figure 4). 

C. Preparation of oSCFV 33H 

pSCFV 31b was digested with Nco I and Sal I and a fragment containing the Cam^ 

gene was isolated. ~ 

TheHum4VL was obtained by PCR DNA amplification using pCGSSI 5/SCFV1 (See 
25 Figure 3) as a template and oligonucleotides 104BH1 and 104BH2(-) as primers. 
104BH1: 

5'-cagccatggccgacatcgtgatgacccagtctc<:a-3' 

104BH2(-): 

5'- aag cttg ccccatg ctg cttt aacg tt agtttt atctg ctgg 

30 AGAC^GAGTGCCTTCrrGCCTCCACCTTGGTCCCTCCGCCGAAAG-3' 

The CC49 Vh was obtained by PCR using p49Yl-2.3 (PCT WO 90/0441 0) as a 
template and oligonucleotides 104B3 and 104B4(-) as primers. A Nhe I enzyme restrirtion site 
was introduced just past the termination codon in the 3' end (before the Bd I site) by 
oligonucleotide 104B4(0. 

35 104B3: 

5'-GTTAAAGCAGCATGGGGCAAGCTTATGACTCAGTTGCAGCAGTCTGACGC-3' 
104B4(-): 
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5'-CrCTTGATCACCAAGTGACTTTATGTAAGATGATGTTTTGACGGATT 
CATCGCTAGCTTTTTATTTGCCATAATAAGGGGAGACGGTGACTGAGGTTCC-3' 

In the PCR which joined these two fragments using ligonucleotides 104BH1 and 
104B4(-) as primers, a coding region for a 22 amino acid linker was formed. 
5 A fragment C (same as above) containing the pen? termination codon was 

isolated from pRW 83 digested with Bd I and Saf L 

Plasmid pSCFV 33H (see Figure 4) was created with a three part ligation of the 
veaor, fragment C. and the SCFV fragment described above. 
D. Preparation of pSCFV UNIH 

To create a vector for any SCFV with the desired restriction sites in place, a 
plasmid designated UNIH was created by digesting pSCFV 33H with A/col and A/hel, and 
isolating the DNA fragment containing the Cam^ gene. 

Hum4 Vl was obtained by PCR DNA amplification using pRLIOOl as a template 
and oligonucleotides UNIH 1 and UNIH2(-) as primers. The DNA sequence in pRLIOOl coding for 
15 the Hum4 Vtisgiven in Figures. Oligonucleotides for the PCR were: 
UNIHl: 

5'-CAGCCATGGCC GACATTGTGATGTCACAGTCTCC- 3' 

The Nco I site is in bold and the hybridizing sequence is underlined. 

UNIH2(-): 

20 5'-GAGGTCCGTAAGATCTGCCTCGCTACCTAGCAAA 

AGGTCCTCAAGCTTGATCACCACCTTGGTCCCTCCGC-3' 

The Hind III site is in bold. 

The CC49 Vh was obtained by a PCR using p49y1-2.3 as a template and 

oligonucleotides UNI3 and UNI4(-) as primers. 
25 UNI3: 

5'-AGCGAGGCAGATCTTACGGACCTCGAG GTTCAGTTGCAGCAGTCTGAC -3\ 
The Xho I site is in bold and the hybridizing sequence is underlined. 
UNI4(-): 

5'-CATCGCTAGC TTTTTATGAGGAGACGGTGACTGAGGTTCC >3'. 
30 The A/he I site is in bold and the hybridizing sequence is underlined 

Oligonucleotides UNIHl and UNI4(-) were used in the PCR-SOE amplification 
which joined the Hum4 Vl and CC49 VH fragments and formed a coding region for a negatively 
charged fifteen amino acid linker. The DNA was digested with Nhe I and Wco I and Itgated with 
the vector fragment from the Nco i-Nhe I digest of pSCFV 33H. The resultant plasmid wa^ 
35 designated pSCFV UNIH (see Figure 4). 

With the construction of pSCFV UNIH, a universal veaorforany SCFV was created 
with all the desired restriction enzyme sites in place. 
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E. Preparation of pSCFV UHH and dSCFV UHM 

pSCFV UNIH was digested with HindWWXho I, and the large DNA fragment 
containing the Cam*" gene, Hum4 and CC49 Vh was isolated. 

A fragment coding for a 25 amino acid linker, UNIHOPE, was made by annealing 

5 the two oligonucleotides shown below. Annealing conditions were as follows: lOpgofeach 
oligo were added to Bethesda Research Laboratories (Gaithersburg, MD) enzyme Buffer #2 for 
a total volume of 53 pL. the mixture was heated to 98*'C for one minute, cooled slowly to 40*C 
and placed at4**C overnight. Following digestion with HindWUXho I, the linker was 
electrophoresed through a 6 percent PAG. The DNA bands were excised from the gel and the 

1 0 DNA was electroeluted for five minutes at 200 volts. The DNA was pelleted by ethanol 

precipitation and dissolved in water. The linker UNIHOPE is based on 205C SCA*" linker (see 
Whitlow et al., Methods: A Companion to Methods in Enzymology. g, 97-105 (1991), but the 
first amino acid was changed from serine to leucine and the twenty-fifth amino acid were 
changed from glycine to leucine, to accommodate the Hind\\\ and Xho I restriction sites, 

1 5 respectively. The nucleotide sequence encoding the linker UNIHOPE isset forth below: 
UNIHOPE (Top Strand): 

5'-TATAAAGCTTAGTGCGGACGATGCGAAAAAGGATGCTGCGAAG 
AAGGATGACGCTAAGAAAGACGATGCTAAAAAGGACCTCGAGTCTA-3' 

U NIHOPE(-) Bottom Strand : 

20 S'TAGACTCGAGGTCCTTTTTAGCATCGTCTTTCTTAGCGT CAT 

CmCTTCGCAG(:ATC(rrTTTTCGCATCGTCCGCACTAAGCTTTATA-3' 

The resulting annealed strand was digested with Hind iU/Xho I and ligated into 
the vector, thus generating the plasmid pSCFV UHH (shown in Figure 6). Plasmid pSCFV UHH 
expresses a biologically active, TAG-72 binding SCFV consisting of the Hum4 Vl and CC49 Vh- 

25 The expression plasmid utilizes the p-lartamase penP promoter, peaate lyase pe/B signal 
sequence and the penP terminator region. Different immunoglobulin light chain variable 
regions can be inserted in the Nco \-Hind\\\ restriaion sites, different SCFV linkers can be 
inserted in the Hind Ill-Xho I sites and different immunoglobulin heavy chain variable regions 
can be inserted in the Xho I Nhe I sites. 

30 F. Preoaration of oSCFV UNIM and oSCFV UHM 

pSCFV UNIM was created in conjunaion with pSCFV UNIH. It differs in that the 
DNA sequence coding for the CC49\/l is incorporated into UNIM as opposed to the DNA 
sequence coding for the Hum4 Vl. The sequence for murine CC49Vl is disclosed in PCT 
WO 90/04410. published May 3, 1990, and was produced by PCR using the oligos UNIM1 and 

35 UNIM2(-) as primers. The CC49 Vh and the vector fragment were obtained by the manner 
described above for pSCFV UNIH. The SCFV insert was generated by PCR SOE using oligos 
UNIMl and UNI4(-) to join the light and heavy chains and form a coding region for a negatively 
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charged fifteen amino acid linker. Following Nhe\ and A/col digestion and purification of the 

vector and insert, a ligation of the two pieces produced pSCFV UNIM. 

UNIM1: 

S'CAGCCATGGCCGACATTGTGATGTCACAGTCTCC-B' 

5 UNIM2(-): 

5'GAGGTCCGTAAGATCTGCCTCGCTACCTAGCAAAA 
GGTCCTCAAGCrrCAGCACCAGCTTGGTCCCAGCAC.3 

pSCFV UHM was generated by digesting pSCFV UNIM with HindlW andXhol. 
isolating the vector fragment with the Camr gene by 4 percent PAG, and ligating the 
10 twenty-five amino acid linker. UNIHOPE, into the vector DNA fragment piece. The procedure 
was the same as described for pSCFV UHH. pSCFV UHM contains DNA sequence coding for an 
scFv having the CC49 and CC49 Vh with a computer predicted pi of 8. 1 . Hence, this piasmid 
was named pSCFV UHM8.1 . Hence, this piasmid was named pSCFVUHMS.I 

The piasmid map of pSCFV UHH and pSCFV UHM are illustrated in Figure 6A. For 
15 pSCFV UHH, Vl is Hum4 Vl, the linker (L) is UNIHOPE and Vh is CC49 Vh- Forthe pSCFV UHM 
series, Vl is CC49 Vh, the linker is UNIHOPE and Vh is CC49Vh. For the pSCFV UHM(X) series, X 
indicatesthe pi ofthe monomer product (pi = 8.1, 5.8 or 5.2) as described herein. Figures 68 
and 6C illustrate the monomeric (scFv) and dimeric (Fv2) forms, respectively, ofthe products. 
The CDR regions are indicated schematically by the black bands. 
2Q The DNA and amino acid sequences of the scFv obtained from pSCFV UHH 

construrtions are given in Figure 7. The UNIHOPE linker sequence is indicated in bold and 
underlined, while the CDR regions are in bold and italicized. The amino terminus ofthe 
mature scFv product is indicated by + 1 . 

f. co/iAGI (Statagene, La JoHa, CA) cells were transformed with pSCFVUHHand 
25 pSCFV UHM8. 1 by following the manufacturer's instructions. Competent cells were thawed on 
ice and gently mixed by hand before 50 pL aliquou were dispensed into 1 5 mL polypropylene 
tubes. A fresh dilution of p-mercaptoethanol was added to the tubes for a final concentration 
of 25 mM. The cells were swirled gently every two minutes for ten minutes. Either 1 or 5 \xl 
of the prepared DNA was added to the tubes. The cells and DNA sat on ice for 30 minutes, were 
30 heat pulsed in a 42**C water bath for 45 seconds and put back on ice for two minutes. 50C 
medium (450 pL) (medium containing baao-tryptone, baao-yeast extract, NaCl MgCl2 and 
glucose) was added to each tube and the tubes were incubated at 37*C for one hour with 
shaking. Either lOOpLor 200 pL of transformed cells were plated on LB agar containing 
20 pg/ml chloramphenicol- Plates were incubated at Sy^C overnight. 
25 From each transformation, twelve to eighteen bacterial colonies were selected 

for overnight growth at BT^C in 2 mL of LB/CAM 20. A portion of the overnight culture (1.2 mL) 
was used for a small scale piasmid preparation following the boiling method adapted from 
Holmes and Quigley(Sambrooketal, 1989,supra). Piasmid DNA was digested with restriction 
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enzymes and the resultant fragments were size analyzed by agarose gel electrophoresis using 
ethidium bromide and long wavelength UV for visualization. The remaining 0.8 mL of 
overnight culture was sonicated (Soniprep 150, MSE Ltd., Sussex), then microcentrifuged for 
3 minutes to pellet the debris. The supernatant was reduced in volume by 50 to 80 percent 

5 using a Centricon 10 microconcentrator device (Amicon, Beverly, MA) and tested for biological 

activity against TAG-72. 

(G) Construction of pSCFV UHM5.8 and pSCFV UHM5.2 

Two Plasmids, pSCFV UHM5.8 and pSCFV UHM5.2 were constructed to express 
CC49 scFvs and CC49 Fv2s having computer predicted pi's of 5.8 and 5.2, respectively. Extra 
negatively charged amino acids were added to the carboxyl terminus of the CC49 Vh, by 
synthesizing two oligonucleotides (oligos) and using them for a polymerase chain reaction 
(PCR) amplification of the CC49 Vl - linker - CC49 Vh DNA present in pSCFV UHM8. 1 . These 
oligos for the 3' end of the target are as follows. For the 5.8 construction, a 58-mercailed 
SCUHPM 5.8 was made. For the 5.2 construction, a 64-mercalled SCUHPM 5.2 was made. 

15 SCUHPM 5.8 

5'-AACA GCTAGC TTT TTA GGA GTC ATA GTC CTC AGG GGA GAC GGT GAC TGA GGT TCC 

TTG-3' 

SCUHPM 5.2 

5' - AACA GCTAGC TTT *TA CTC ATA CTC TTC AGG GTC TTC AGG GGA GAC GGT GAC TG A GGT 
20 TCC TTG-3' 

The nucleotides representing Nhe I restriction sites are indicated in bold. These oligos were 
purified from a 14 percent polyacrylamide gel containing urea (Sambrook et al., supra). The 5' 
oligo for the PCR was a 34 mer designated UNIMl (sequence given above). 

The nucleotides representing an Nco \ restriction site are indicated in bold. For 

25 the PCR's, lOOpmol of the 5'(UNIM1) and 3' (SCUHPM 5.8) oligos were used to generate the 

CC49 scFv 5.8 DNA product insert, while 100 pmol of each of UNIMl and SCUHPM 5.2 were used 
to generate the CC49 scFv 5.2 DNA product insert. The target DNA for the oligos in the PCR 
reaction (100 uL) was 1.0 ng of pSCFV UHM8.1. The reaction was performed using a 
programmable thermal controller ( MJ Research Inc., Watertown, MA) and Taq polymerase 

30 according to the directions provided by Perkin Elmer Cetus (Norwalk, CT). There were 30 cycles 
consisting of 40 seconds at 94*C, 70 seconds at 45°C and 75 seconds at 72'*C. For the final cycle, 
the temperature was held at 72**C for 2 minutes. The PCR products were analyzed on a 
1 .0 percent agarose gel containing ethidium bromide and found to be the expected sizes of 
788 bp and 794 bp for the 5.8 and 5.2 CC49 scFv constructions, respectively. The DNA was 

35 preparatively purified from a 4 percent polyacrylamide gel, digested with Wco I (from BRL, 
Gaithersburg, MD) and Nhe i (from New England Biolabs) using New England BioLabs 10 x 
buffer #2 (lOmMTris-Hcl, 10 mM MgCl2, 50 mM NaCI, 1 mM OTT, pH 7.9). 
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The resulting trimmed Nco I and Nhe I oligonucleotides fragment of the correct 
size were purified away from the DNA inserts by elearophoresis in a 4 percent polyacrylamide 
gel. The CC49scFv inserts were electroeluted and ethanol precipitated (Sambrook). The vector 
for ligation with these inserts had been prepared from pSCFVBBH digested with Nco I and 

5 Nhe I. Competent £. co// AG 1 cells (Strategene, La Jolia, CA) were transformed with the ligated 
DNA and plated on LB plates containing 20 ug/mL chlorarr.phenicol. 

Four clones were chosen from each transformation to prepare miniprep plasmid 
DNA (Sambrook) for restriction enzyme analysis. Based on these results single clones were 
chosen; E. co/i AGI/pSCFV UHMS.Sf. co/iand AGl/pSCFV UHM5.2. 

^0 The DNA and amino acid sequences of the CC49 species, pSCFV UHM8.1 (A), 5.8 

(B) and 5.2 (C), are given in Figure 8. 

Preparation of Total Cellular Protein and Escherichia Coli Periplasimic Shockates 

For analysis of total cell CC49 scFv activity or preparation of samples for sodium 
dodecyl sulfate polyacrylamide gel elearophonesis (SDS-PAGE), £. co//{pSCFV UHM8.1; pSCFV 

1 5 5.8; or pSCFV 5.2) liquid cultures (2.5 mL LB supplemented with 20 jig/mL chloramphenicol) 
were grown overnight at 37°C. Cells from 1.5 mL of culture were pelleted in a microcentrifuge, 
the supernatant discarded, and the cell pellets resuspended in 0.5 mL phosphate buffered 
saline (PBS) (Sigma Chemical). Sample tubes were placed in an ethanol ice bath before 
sonication (MSE Soniprep 150, UK) using 3 cycles of disruption at 14 microns amplitude for 15 

20 seconds each and cool down for 45 seconds in between. The sonicated cell debris was pelleted 
in a microcentrifuge, and the supernatant, representing the total, soluble protein fraction, was 
filtered through a Millex-GV 0.22 p filter disc (Millipore, Bedford, MA). 

For purification of the CC49 scFvW derivatives, where x = 8. 1 , 5,8 and 5.2, E, coli 
periplamsic fractions were prepared from 1.0 L overnight cultures. A 1 .0 L culture was divided 

25 into 4x250 mL portions and centrifuged at 5,000 rpm, for 10 minutes in a Sorvall GS-3 rotor. 
The pelleted cells were washed and resuspended in 100 mL each of 10 mM Tris-HCl pH 7.3 
containing 30 mM NaCI. The cells were again pelleted at 5,000 rpm for 10 minutes. Cells were 
washed with a total of 100 mL of 30 mM Tris-HCI (pH 7.3) and pooled into 1 tube. To this was 
added 100 mL of 30 mM Tris-HCI pH 7.3 containing 40 percent w/v sucrose and 2.0 mL of 10 mM 

3Q EDTA, pH 7.5. The mixture was kept at ambient temperature, with occasional shaking for 
1 0 minutes. The hypertonic cells were centrifuged as above. The cells where then quickly 
resuspended in ice cold 0.5 mM MgCl2 and the suspension kept on ice for 10 minutes with 
occasional shaking. The cells were centrifuged as above and the shockate, containing the 
f . co/i periplasmic fraaion was further clarified by passing through a 0.2 ja filter apparatus 

35 (Nalge Co., Rochester, NY). This material was concentrated using Centriprep 30 and Centricon 
30 ultrafiltration devices (Amicon Beverly, MA) to a final volume of 1-2 mL. 



.24- 



BNSDOCID: <WO 940469 1A2 J , > 



wo 94/04691 



PCr/US93/07918 



For isolation and analysis of just cytoplasmic proteins, previously shocked f. co/i 
cells (0.5-1 .0 mg - wet weight) were resuspended in 0.5 mL PBS and sonicated as above. The 
cytosol was recovered by microcentri-f ugation and filtered through a 0.2 ji filter device. 

Prior to further purification, the shockate was dialyzed against 50 mM sodium 
acetate buffer, pH 4.95, using a System 500 microdialyzer unit (Pierce Chemical, Rockford, IL), 
with 3-4 exchange of buffer:^ An 8,000 molecular weight (MW) cut-off membrane was used. 
Purification of CC49 Single Chain Monomer and Dimer Species 

The general purification of the single chain antibody species is given in Scheme 1. 
The procedure under column 1 was initially used for purification of antibodies from pSCFV 
U HM8. 1 and was later modified as given under column 2. Single chain antibody species derived 
from pSCFV UHM5.2 were purified as given under column 3. Final purification of the 
pSCFV UHM5.8 produrts was not completed. 
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10 



A. CC49 scFv 8. 1 

For single chain antibody species from pSCFV UHM8. 1 , about half of the £. co/i 
AGI/pSCFV UHMB.l periplasmic shockate prepared from 1 Lof culture as described above was 
dialyzed overnight against 50 mM sodium acetate buffer, pH 4.95. Precipitated proteins were 
pelleted by microcentrif ugation and the supernatant was applied to a cation exchange column 
(Mono S HR5/5 HPLC column, Pharmacia, Piscataway, N.J.). A linear gradient program utilizing 
50 mM sodium acetate pH 4.95 as buffer A and 50 mM sodium acetate pH 4.95 containing 0.3 M 
NaCI as buffer B was run as follows:. 



15 
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When the periplasmic fraction was chromatographed over Mono S, two areas of 
activity were observed (Figure 9) and the two pools kept separate for the next step. The lanes 
1 -9 for SDS-PAGE correspond to a crude periplasmic fraaion (lane 1)and fractions 1,2, 3, 12, 16, 

25 2 1 , 26 and 43, lanes 2-9 respectively, with a Western transfer of the samples from a duplicate 
gel, indicate produa in fractions 26 and 43 (lanes 17 and 18). 

Pool 2 from the Mono S purification was concentrated to below 250 \il using a 
Centricon 30 device and diaiyzed against buffer A (20 mM Tris-HCI, pH 7.60) for 
rechromatography on the Mono Q HR5/5 anion exchange column (Pharmacia, Piscataway, N.J.). 

30 Buffer B was the same as buffer A, but in 0.5 M NaCI. The gradient program used was the same 
as for the Mono S run above, but for this Mono Q run the Mono S pool 2 sample was changed 
to 100 percent B the gradient was at 16 percent B. 

The anion exchange chromatography and SDS-PAGE of the Mono S Pool 2 sample 
is given in Figure 10. The arrow on the chromaxogram indicates the point in the gradient of 

35 Buffer B (16 percent) where the percent of B was changed to 100 and held for 5 minutes. Lanes 
1-9 correspond to fractions 1-3 of a similar chromatography (not shown) done on Pool 1 
(Figure 9); a sample of Pool 1; a sample of Pool 2; and fraaions 1-4 from the anion exchange 
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chromatography of Pool 2. Lanes 10-18 represent the same fractions as in lanes 1-9 in a 
Western transfer fr m a duplicate gel. 

The final step in the purification utilized a Superdex 75 HR10/30 gel filtration 
column (Pharmacia). Potassium phosphate buffer (0. 175 M, pH 6.85) was used as the mobile 

5 phase, at a flow rate of 0.2 mUmin. Fraction 1 from the Mono Q purification was applied to the 
Superdex column and the resulting chromatogram is shown in Figure 1 1 A-1 . Both peaks 
emerging at 42 and 50.5 minutes were active in the competition assay, with considerably more 
aaivity being observed in the first peak (42 minutes). On SDS-PAGE and Western transfer 
(Figure 1 1 B) a band for the correct molecular weight of monomeric single chain antibody was 

10 observed. Lanes Bl and 8 are ChCC49 Fab; lanes 2 and 9, fraaion 14; lanes 3 and 10, fraction 
15; lanes 4 and 11 fraaion 16; lanes 5 and 12. fraction 17,; lanes 6 and 13, fraction 18; and 
lanes 7 and 14, fraaion 19. Application of a ChCC49 Fab fragment to the gel filtration column 
(Figure 1 1 A-2), using the same conditions as for the scFv sample, indicated that the scFv peak 
emerging at 42 minutes contained a protein that was larger than the Fab, which emerged at 

15 45.8 minutes. When the peak emerging at 42 minutes was subjerted to the reducing condition 
of 5D5-PAGE, a monomeric unit on the gel and Western was seen, indicating a con-cova lent! y 
linked dimeric (Fv2) form of the single chain antibody. Further evidence of a unique dimeric 
entity was obtained when an lEF gel was run. 

lEF of frartions 15 (lanes CI and C3) and 17 (lanes C2 and C4), indicates there is a 

20 distina difference between the pi of the Fv2 (lanes 1 and 3) and the monomeric scFV (lanes 2 
and 4). Samples in lanes 1 and 2 were stained with Coomassie Brilliant Blue R-250, while lanes 3 
and 4 represehta Western transfer of the corresponding lanes of a duplicate lEF gel. The faint 
FAID1 4 positive band in Figure 1 1C, lane 3, above the main Fv2 band is not known. It could be a 
trace degradation product or some higher multimer. 

25 A second purification system was then developed to optimize the purification of 

the monomer, dtmer and multimer species of the pi 8.2 single chain antibodies. The procedure 
is outlined in column two of Scheme 1 . The buffer system for purification with the Mono S 
cation exchange column was also changed to 50 mM sodium acetate pH 5.2 and 50 mM 
2-{N-morpholino)ethane sulfonic acid (MES). 

3Q The protocol which afforded pure CC49 scFv and Fv2 8.2 was as follows: A 

periplasmic fraaion was dialyzed and applied to mono S using a 50 mM sodium acetate buffer 
system, as before, but at pH 5.20. At this pH less protein precipitated than at pH 4.95 but the 
results were much the same in terms of the chromatography. Next, the Mono Q column was 
utilized as above. After this, a second Mono S run was introduced for the aaive MonoQ 

35 frartions. The buffer system used was: buffer A (50 mM MES, pH 5.6) and buffer B being the 
same as buffer A in 0.5 M NaCI. The monomer and dimer were finally purified using the 
Superdex 75 HR10/30 column and PBS as the buffer. 
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B. CC49 scFv5.2 and Fv2-5.2 

The periplasmic shockate from a 1 .0 liter culture was prepared as described 
above. The strategy for purification used here was to separate the monomer from the dimer in 
a gel filtration step first. A Superdex 75 prep grade HR 26/60 preparative column was used. The 
5 concentrated crude sample (0.5 mL) was chromatographed at a flow rate of 1 .3 mUmin, using 
0.175 M potassium phosphate buffer pH 6.8. See Figure 12for the chromatogram. The elution 
times for the molecular weight standards: (1) bovine IgG (Pharmacia), MW - 153 Kd; 
(2) chicken ovalbumin, MW = 44 Kd; and (3) horse myoglobin, MW 17 Kd from a separate run 
are as indicated by the arrows. 

SDS-PAGE analysis of the fractions was as follows: lanes 1 and 14, ChCC49 Fab; 
lanes 2-13 correspond to fractions 3, 8, 10, 12, 14, 16, 18, 19, 20, 22, 23 and 24 respectively and 
lanes 15-24 correspond to fractions 25-33 and 35, respertiveiy. 

Both the resulting dimer and monomer pools were concentrated to 300 pL with 
centricon 30 devices and dialyzed overnight against 20 mM ( 1 ,3-bisltris(hydroxymethyl)- 
15 methyiaminolpropane (Bis-Tris propane)-a pH 6.8 buffer and applied to the Mono Q HR5/5 
column for the monomer and dimer purification. The same gradient program was used as 
described earlier with buffer B being 20 mM Bis-Tris propane-CI pH 6.8 in 0.5 M NaCI. The 
active Mono Q fractions were pooled and concentrated to 1 30 The sample was applied to 
the Superdex 75 HR 10/30 column to verify purity of the monomer and exchange the buffer to 

20 

Antibodv Competition 

Using purified monomer and dimer, it was possible to quantitate them accurately 
(Table 1 ) and compare them directly in a competition assay^o a Fab, which is monovalent, and 
the CC49 whole antibody, which is divalent. The results of this study are given in Figure 13. The 
25 results indicate the dimer and whole IgG have similar activity. These results further support the 
data of the existence of the dimer, indicating the dimer is folding in a way that gives two 
active, accessible binding sites. 



30 



35 



-29- 



BNSDOCIO: <WO 940469 1A2 I . > 



wo 94/04691 



PCr/US93/07918 



c 
< 



u 
u 



c 
o 



(0 
N 

u 

0) 

u 



U > 



u > 



USX 

u > u 



m 

u 
a 



o 



> 



> X 

Gb X 



CM fM 

Cm in 



> X rst 
O =5 in 



CM 00 

> • 
U« tn 



> ® 
U ± in 



> 



> 
%J 



o 

E 
O 



o 

E < 



o 1-1 n z: 



o ^ ^ 

CN ^ CM 



CM CN o rvi 2 
rn n ^ m 



^ m ^ «H "* f-i J^' 



00 XO ' 

m ^ ^ ( 



<>• CD <\I ^ V 



000_0<NrNOO 



o 
in 



t-t cn r-i x: t — t >i o 0) CO ui-H u a>x: a >n cl 
I I I I I I I I I I I t I I I I I I I I £: 



00 



o 
c 





• 


CO 






0) 




m 


3 




w 


•o 




*o 






>* 


CO 






0) 




o 






JO 








Ifl 






>1 












c 


• 




0) 


w 


a 


a> 


c 


c 


3 


o 


o 






^ 


0) 




f-« 




(d 






■o 




c 






o 


»H 


CI 




00 






O 




cd 






E 






M 






o 








V) 








at 


TJ 








cn 








a 


^ 


c 






01 


c 




c 


9 


"O 


o 


C 


-1-1 




O 


«u 


B 


a 


f-H 


o 


E 


3 


u 


o 


CO 




u 


-•-1 


a> 




T3 


to 


« 




O 


a 




x: 




O 














E 




•o 


o 


E 






O 






ki 


to 












-r-» 








b 




« 




o< 




cn 




0) 








u 




CO 


u 






SI 


u 


c 


u 












0) 


o 




x: 
















» 




UJ 




O 


o 


• 3 








IS 




O (0 


X 


X 


> 








0} 
















c 




CN 





-30* 



SNSOCXID <WO 9404691A2 I > 



wo 94/04691 



PCT/US93/07918 



. The activity of the purified CC49 scFv5.2 was compared with the ChCC49 Fab in a 
competition assay (Figure 14) and the scFv was found to be equivalent to the Fab. This indicates 
one can add certain features to the carboxyl terminus of a scFv construction, as in this case the 
addition of Pro-Glu-Asp-Pro-Glu-Asp-Tyr-Asp. that do not affect the binding site. Therefore, 
5 other modifying sequences can be contemplated which alter the overall characteristics of the 
molecule. 

Amino-Termina! Amino Acid Analysi s 

The amino terminal sequencing data gave the following result: NH2-7-I-V-M-S-Q- 
S-P-S?-T?-L. The first residue couldn't be unequivocally assigned, but its positioning indicates 

10 that signal peptidase had cleaved at the end of the pe/B signal sequence, and released a 

corrertly processed mature product. The residues indicated at positions 2-1 1 correspond to the 
sequence prediaed by the DNA sequence (See Figure 8), The yields for Ser and Thr at positions 
9 and 10 were low, but in general, Ser and Thr yields are known to be lower than for the other 
amino acids. Based on these data, one can conclude that the pel B signal sequence is effective 

15 in directing the secretion of the single chain products to the £. co/i periplasm. It is apparent 
that one need not be limited to using the pel B leader, and that the same results, as far as 
processing and secretion of the product is concerned, can be obtained by using other signal 
sequences such as omp A and pen PC, and others. One can include these sequences in the PCR 
oligo when making the construction upstream from the mature protein. 

20 Temperature Stability Study of CC49 Fv2 8.1 

Approximately 3 pg aliquou of CC49 Fv2 8. 1 kept at 37*C were analyzed at 
intervals over 24 hours by gel filtration chromatography. A Superdex 75 HR1 0/30 column was 
utilized, using PBS as the mobile phase, at a flow rate of 0.5 mL/min. 

The results of samples taken at various time points over 24 hours at 37°C indicated 

25 that the Fv2 remained intact (eluted at the same position as the starting material), and that no 
degradation products were observed. 
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Example 2 Cloning of CC49 Si ngle Chain Antibody Species into the Yeast Pichia pastoris 

For expression of single chain antibody species in yeast, the strain Pichia pastoris 
GTS115 H/54 (NRRLY- 15851) carrying a defective histi dine dehydrogenase gene was used as a 
host cell. The plasmid pHIL-SI , available from Phillips Petroleum Company, was used as the 
5 expression plasmid modified as described below. The original sequence and features in the 5' 
region of pHIL-Sl are as follows: 

^ 5'AOXl ^ PHOXl Signal Sequence 

5 ' - , . > TT ATTCGAA ACG ATG TTC TCT CCA 

Bst B I 

ATT TTG TCC TTG GAA ATT ATT TTA GCT 
^ PHOXl Signal Sequence ^ 

TTG GCT ACT TTG CAA TCT GTC TTC GCT '' CGA 

Xba I 

15 Multi Cloning Sites ^ 3 ' AGXl 

GAA TCC CCC GGG ATC C TT AGA CAT ....-3' 
EcoR I Sma I BamH I 

indicates the signal peptide cleavage site. 

20 

DNA Sequencing 

The dideoxy chain termination nnethod of Sanger (Sambrook et al. supra) was 
used for DNA sequencing of the CC49 scFv genes present in pPY21 and pPY22. A Sequenase*" 
sequencing kit (United States Biochemical, Cleveland, OH) was used for the sequencing 
reactions, according to the manufacturer's directions. The radiolabel used was a-P32 dATP 
(3,000 Ci/mmol) obtained from Amersham (Arlington Heights, IL), lot no. AC9048. Double 
loadings were done using a BRL Model S2 sequencing apparatus and a premixed 6% Get-Mix 
sequencing gel (BRL). After elearophoresis, gels were dried for 30 min using a Model 483 Slab 
Gel Dryer (Biorad, Richmond, CA) and exposed to Kodak X-OMat film (Sigma Chemical. St. 
Louis, MO) for 1-24 h. The sequencing oligonucleotides used for priming (0.5 pmo! each) were: 
AOX1 P (23-mer), 5'-TTTAACG ACAACTTGAGAAG ATC-3' ; 
TAGVLFR2 (20-mer), 5'-TGGTACCAGCAGAAACCAGG-3'; 
TAGVLCDR3 (Mixed 22-mer), 
5'-GTCAGCA(AG)TATTATAG{CT)TATCC-3'and; 
TAGVHCDR2 (21-mer), 5'-ATGGATTGGATATTTTTCTCC-3' 
Construction of pPYI 

A clone of £. coli AGl transformed with pHIL-Sl was used to isolate purified 
pHIL-SI plasmid using Qiagen's midi plasmid prep procedure (Chatsworth, CA). Three 
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micrograms of purified pHIL-Sl were treated with Sst I (Biological Research Laboratories. BRL) 
(10 units) in a volume of 40 pLfor 45 minutes and then 2 iiL of 1 .5 M NaCI and 20 units (2 pL) of 
Sal I (BRL) were added (see Figure 1 5A). After the reaaion proceeded for about 45 minutes, the 
sample was loaded onto a (16 X 18 cm) 4 percent potyacrylamide gel (PAG). 
5 In a separate reaction. pBluescript II KS + (5 pg) purchased from Stratagene, was 

treated with Sst I (12.5 units) in a volume of 40 tiLfor45 min. (see Figure 15B). The buffer was 
then modified to favor Sal I aaivity. by adding 2X of 1 .5 M NaCI. and then 25 units of Sal I were 
added. After another 45 min at 3TC. this sample was also loaded on the 4 percent PAG. DNA 
fragments were separated by electrophoresis at 90 volts over 2.5 h. 

pHlL-SI gave 2 fragments, a larger one at 5.9 Kbp containing the ampicillin 
resistance gene and a smaller fragment at 2.5 Kbp containing most of the 5'AOX1 region and 
multiple cloning site. Both fragments were cut out of the gel and isolated by elearoelution 
and ethanol precipitation. 

For the pBluescript II digest, 2 bands were visualized, one corresponding to the 
^ 5 vector piece at about 2.8 Kbp and a small fragment from the multiple cloning site. The 2.8 Kbp 
fragment was recovered as above. TomakepPYI (Figure 15Q, the pBluescript II Sst l«a/ 1 2.8 
Kbp fragment (-1 00 mg) was used in a ligation reaction with the 2.5 Kbp pHIL-S1 fragment 
(-100 mg). The ligation kit components were from Stratagene. while the T4 DNA ligase was a 
New England Biolabs product. Ligation commenced at 18"C, with a cooling gradient down to 
20 4'>C overnight. £. coV/SURE (Stratagen. LaJolla, CA) was transformed with this ligation mix (1 pL 
and 5 pL samples). After overnight growth on Luria Broth (Sambrook et al.. Supra) plates 
containing 100 pg/mL ampicillin, uncolored colonies (plasmids containing inserts) out- 
-numbered blue colonies (without inserts) by roughly 10:1. Five of the uncolored colonies were 
picked for plasmid screening with Sst I and Sal I. All 5 gave the restriction pattern expected 
25 based on the plasmid construction (2 bands at 2.8 Kbp and 2.5 Kbp). The plasmid from a clone 
designated #4 was tested further and found to have the correct sites for BstB I, Xho 1 (2 sites), 
£coR V and BamH I. and was thus picked as the representative clone for pPYI . 
Construction of pPY2 

Using the plasmid mini prep DNA of pPYI from above, 2.5 pg were treated with 
30 20 units of £coR V (BRL) in a reaction volume of 45 pL at 37»C for 30 min. A single linearized 
DNA fragment at 5.3 Kbp was purified from a 3.75 percent PAG as described above. A linker 
DNA fragment with single £coR I and Not t sites was then ligated into the blunt ended EcoR V 
site as follows. 

The oligos ECONOT, a 20-mer: 5'-GAATTCTTAGCGGCCGCrTG-3' for the top 
35 strand and TONOCE, a 20-mer: 5'-CAAGCGGCCGaAAGAATTC-3' for the bottom strand were 
synthesized and purified for annealing. A 60 fold molar excess of annealed linker over the f coR 
V treated pPYI fragment (5.7 X 10-i^ moles -20 mg) were ligated using T4 DNA ligase and 
temperature conditions as above. Competent E. co/i AG1 cells were transformed with aliquots 
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(1 and 5 pL) of this ligation mix and plated (75 or 150 pL) onto LB agar plates containing 
100 pg/mLampicillin. Twelve clones were picked for plasmid screening with EcoR I. Out of 
these, 1 0 clones had the correct two £coR I fragments of roughly 600 bp and 4.7 Kbp. Four of 
these clones were selected for DNA sequencing of their plasmids for verification and 
5 orientation of the linker sequence. The oligo used for the sequencing (0.5 pmol) was 

ECOVNHESEQ,a 19-mer: 5'-TGCGCATAGAAATTGCATC-3'. Twoof the plasmids had the correct 
sequence and desired orientation, one was chosen as the representative pPY2 clone (see Figure 
15D). 

Construction of pPY21 and pPY22 
-|Q A common vector derived from pPY2 was used to generate both pPY21 and 

pPY22. Two micrograms of pPY2 mini-prep DNA were treated with 24 units of BstB I (New 
England Biolabs) in a 20 pL reaction volume at 65*C for 1 5 min. At this point, 1 .5 M NaCI was 
added (1.2 pL) with 12 units (1.2 pL) of 8amH I (New England Biolabs) and the reaction allowed 
. to proceed for 1 5 min at 37''C. This sample was loaded on a 4 percent PAG, and the pPY2 
1 5 fragment with BstB I and BamH I ends of roughly 5.2 Kbp was recovered by electroelution and 
ethanol precipitation. 

The PCRsto generate 2 CC49 scFv inserts for the pPY2 vector were performed as 
follows. Oligos (100 pmol each) targeting pSCFV UHM5.2, prepared as described in Example 1 
(5 ng) were: 
20 1) PY49VLI, a 92-mer, 

5'-ACATTTCGAAACGATGCrrTTGCAAGCTTTCCTTTTCCT^ 
GTTTTGCAGCTAAGATATCTGCTGACATTGTGATGTCACAGTCrrC-3'; and 

2) pPY523, a 36-mer. 

5'-AAATGGATCCTATTAGTCATAGTCTTCAGGGTCTTC-3'. 
25 Taq polymerase was purchased from Perkin Elmer Cetus (Norwalk, CT) and used in conjunction 
with the GeneAmp" kit, according to the manufacturer's directions. The conditions for the 
PCRwere: 30 cycles of 94°C for 30 sec, 52°C for 30 sec and 72*0 for 1 min. On the final cycle, the 
last elongation step (72''C) was 2 min. An MJ Research (Watertown, MA) thermal cycler was 
used. 

3Q For the PCR of the second CC49 scFv insert, oligos (100 pmol each) targeting 

pSCFVUHMS.I (see example 1)(5ng)were: 

1) PY49VL1, as above; and 

2) PSC49VH,a40-mer, 
5'-ATGTGGATCCCTATTAGTAGGAGACGGTGACTGAGGTTCC'3' 

35 The reaaion was set up and carried out as described for pSCFV UHM5.2, above. The PCR 
products were purified free of reactants by electrophoresis in a 4 percent PAG. The desired 
bands from the pSCFV UHM5.2 and pSCFV U HM8. 1 targets were visualized at the expeaed 
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875 bp and 854 bp sizes respectively. The DNA was recovered by electro lution and ethanol 
precipitation. 

The PCR inserts were prepared for ligation by trimming the ends off of each with 
BstB I at the 5' ends and BamH I at the 3' ends, using the same reaction conditions as described 
5 above for the pPY2 veaor. After this treatment, the DNA inserts were again purified using a 
4 percent PAG, subsequent electroelution and ethanol precipitation. The DNA inserts 
generated from pSCFV UHM5.2 and pSCFV UHM8.1 were used to make pPY2r and pPY22, 
respectively (Figure 15E). The ligation reactions regenerating pPY21 and pPY22 were set up 
(20 pL total volume) using 1 00 mg of pPY2 vector for each, and a 1 : 1 stoichiometric equivalent 
10 ofthe prepared PCR inserts from P5CFVUHM5.2 and pSCFVUHMS.l respeaively. Theligation 
kit components and T4 DNA ligase were from Stratagene and used according to their 
direaions- The ligation reaaions were incubated at 1 5^C for 2 hours, and 2 \xl from each used 
to transform E. coli AG 1 competent cells. Samples were plated onto LB agar containing 100 
pg/mL ampicillin. Seven clones each for pPY21 and pPY22 were picked for screening of their 
1 5 plasmid DNAs, using the BstB l-fiamH I double digest described above. A single clone (#6) was 
obtained with the correa DNA fragments for pPY21, while four clones (#11-14) were obtained 
for pPY22. The clone for pPY21 and pPY22 clone #12 were selected for DNA sequencing ofthe 
insert DNAs containing the CC49 scFv construrtions. Both clones contained the correct 
sequence for CC49 scFv, 5.2 (pPY21) and 8.1 (pPY22), respectively, and were used for the final 
2Q expression plasmid constructions. 

The DNA and amino acid sequences of CC49 scFv in pPY2 1 and pPY22 are given in 
Figure 1 6. In these scFv constructions, the Vl gene came first followed by the Vh and was joined 
by a 25 amino acid linker which is in bold and underlined. The oligonucleotides used as DNA 
sequencing primers are shown above the DNA sequence lines. Restriction enzyme sites are as 
25 indicated, and underlined. TheSUC2 signal peptide is underlined; the putative first amino acid 
of these mature products is indicated by ( + 1). Complementarity determining CDRs regions 
segments for the CC49Vl and CC49Vh are italicized in bold. Sequences for pPY2 1 , giving 
negatively charged proteins (CC49, scFv5.2, Fv(2n)-5.2) are indicated by sequence A; and 
sequences for pPY22, giving close to neutrally charged scFv products (CC49. scFv8. 1 P, 
30 Fv-(2n)-8. 1 P) are represented by sequence B. 
Construaion of pPY31 and pPY32 

The Sst l-Sa/ 1 DNA fragments of pPY21 and pPY22 containing the CC49scFv 
constructions were isolated and purified, to be used as inserts with the pHIL-SI vector fragment 
(5.S Kbp) already described above. Three micrograms of pPY21 and pPY22 plasmid mini-prep 
35 DNAs from above (same clones as were sequenced) were treated with 9 units each of Sst l(BRL) 
in a 30 reaction at ST^'C for 20 min. The samples were elearophoresed on a 4 percent PAG. 
The resulting bands at 3.2 Kbp (containing the DNA of interest) and at 2.8 Kbp were isolated 
together, since the resolution was not adequate for good resolution on the PAG. These pairs of 
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DNA inserts were then ligated with the pHIL-SI Sst l-Sa/ 1 vector fragment (100 mg) in a 1 : 1 
stoichiometric ratio, in a 20 pL reartion volume for 2h at 1 5»C to form pPY31 and pPY32 
(Figure 1 5F). Competent £. coli AG 1 cells were transformed with 2.5 pL aliquots of the ligation 
mixes and plated onto LB agar plates containing 100 pg/mL ampicillin. Six potential clones each 
5 of pPY31 and pPY32 were selected for plasmid screening. The resulting mini-prep DMAs were 
digested with an Sstl-Sa/I double digest, as described above for the insert preparation. Three 
clones had the correa 2 bands (3.2 Kbp and 5.9 Kbp) for pPY31 . one was chosen as the pPY31 
representative. Two clones had the correct bands for pPY32 and one of them was chosen as the 
representative for transformation into Pichia. 
Transformation of Pichia oastoris GTS 1 1 5 with pPY3 1 and oPY32 

The procedure of Becker and Guarente for [Methods in Enzymology, 194^ 182 
(1990)]. elearoporation of yeast cells was followed with some modifications, as described 
below, to introduce the CC49 scFv genes into P/ch/a. Yeast extract peptone glucose (YEiD) 
medium (125 mL) was inoculated for overnight culture at 30-C from a stock of P. pastoris 
15 GTS1 1 5 stored at A'C. The cells were harvested the next day when the O.D.fioonm was between 
0.8-0.9. After washing with sterile water and 1 M sorbitol, the cells were resuspended in ice- 
-cold 1 M sorbitol (150 pL) and held on ice. Aliquots of pPY31 or pPY32 (2 pg/pL) linearized 
with Bgl II. totaling 5 pg and 10 pg were added toAOpLeach of the Pichia electro competent 
cells and transferred to pre-chilled (on ice) 0.2 cm sterile elearoporation cuvettes (Biorad). 
20 Each sample was pulsed at 1.5 kV. 25 pF and 200 ohms, using a Biorad Gene Pulser with Pulse 
Controller. The time interval of the pulsers was 4.7-4.8 msec. Immediately after each pulse, 
1.0 mL of cold 1M sorbitol was added to the cuvette, and three RDB plates for each of the 4 
transformations (see U.S. Patents 4,855,231. 4,879.231 and 4.808.537) were plated (200 pL 
each). Plates were incubated at 30»C. Four days later, 36 colonies (numbered 1-36) transformed 
25 with pPY31 and 35 colonies (numbered 37-71) transformed with pPY32 were randomly picked 
and inoculated into 2.0 mL of BMGY media and streaked on MD and MM plates. All were 
grown at 30''C. BMGY is buffered minimal glycerol-complex medium containing 1 m potassium 
phosphate (pH 6.0); 1 3.4 g/L yeast nitrogen base with ammonium sulfate; 400 pg/L biotin; 
10 g/L yeast extract; and 20 g/L peptone. MD is a medium containing 13.4 g/L yeast nitrogen 
30 base; 400 pg/L biotin; and 20 g/L dextrose. MM is the same as MD with 5 mUl of methanol 
substituted for the dextrose. 

Clones able to grow on MM pates are not AOX; implying that the Bgl II fragment 
did not incorporate at the AOX) locus, but at some other chromosomal location. The MM 
plates have only methanol as a carbon source. After 2 days of growth in the BMGY medium. 9 
35 potential AOX mutant lines (observed by slow growth on the MM plate. 3 for PPY31, 

designated Nos. 4. 31 . 32 and 6 for pPY32, designated Nos. 39, 44, 60, 61 , 65 and 71 and an 
AOX' control (Clone No. 1 1, from the pPY31 set) were transferred to BMMY medium (2.0 mL 
each). BMMY medium is the same as BMGY with 5 ml/L methanol substituted for glycerol. 
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Aiiquots of methanol (20 pL) were added to each tube, roughly every 10-14 h. Samples (100 pL) 
from each culture were taken 32 h after methanol induction and the biological activity 
determined by a competition assay described in Example 1. SDS-PAGE 10-20 percent gradient 
gels were run on 10 pL aiiquots taken 32 and 100 h post induction. Western transfer and 

5 detection by biotinylated FAID 14 was also done for the 1 00 hour samples. 

The relative anti TAG-72 activities were evaluated for the methanol slow (AOX-) 
clones, along with some normal growing clones on MM medium. Biotinylated CC49 was used 
as the competitor in a competition assay. The results are presented in Figure 17, where the 
percent competition for a given sample was calculated based on ODAosnm-ODAsonm readings of 

1Q the average of triplicate samples assayed using the following formula: 

zero competition - Sample = % competition 

zero competition - 100% competitiow 

The zero competition value was obtained from a 0.2 micron filtered BMMY medium sample, 

15 while the 100 percent competition value was obtained using a 25 pL sample of CC49 IgG at 
5 pg/mL. The Ptchta samples for assay were prepared by microcentrif ugation of the culture 
sample and filtration through a 0.22 p filter. For pPY31 the results indicated that the methanol 
slow clones 4 and 32 competed very well (Fig. 1 7) as did clone 31, which by observation of 
growth on the MM ptate could not be categorized as methanol slow. Clones 1 1 and 17 were 

20 producing little or noscFv. For pPY32, the methanol slow clones 39, 60, 61, 65 and 71 all gave 
excellent competition. Oones 38, 44 and 50 gave little or no competition. 

The SDS-PAGE-Western data corroborated the competition ELISA resulu. The 
main product of the crude Pichia culture supernatant observed on SDS-PAGE-Western was the 
scFv product. The Pichia products correlated well with the previously obtained E. coli product 

25 equivalents in terms of size on the SDS-PAGE and activity in the competition assay, in the case 
of the pi 8. 1 Pichia produas, a very slightly slower mobility was observed relative to the 
Western positive band representing the E coli CC49 scFv 18. 1 product. This is due to the 
increase in molecular weight of the Pichia product due to the carboxyl terminal tyrosine residue 
weighing approximately 78 daltons more than the corresponding serine residue at the same 

30 position in the £. co/i product. 

Purification of Sinole Chain Antibody Species from P. pastoris GTS1 15 dPY31 Culture 

Pichia pastoris carrying the plasmid pPY3 1 produces a CC49 single chain antibody 
having a pi of about 5.2. Starter cultures (20 mL BMGY medium) was inoculated (Clone #4 
above) and grown for 26.5 hours at 30''C. This was used then lo inoculate 800 mL BMGY 

35 medium in a 2.8 L Fernbach flask. The culture was grown at 30**C for 24 hours with shaking at 
ISOrpm. The P/chia cells wer pell ted at 10°C using 2 tubes and a G53 rotor (Sorvall) run at 
5,000 rpm for 10 minutes. The cells from each tube were aseptically resuspended in BMMY 
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medium (40 mL) to which 75 jiL of methanol were added and continued to shake at 30**C. 
Methanol (200 jiL) was added approximately every 12 hours over a 36 hour gr wth period. 

Culture supernatants (containing the single chain antibody products) were 
harvested by spinning the cultures at 5,500 rpm in a SS34 rotor (SorvatI) for 5 minutes. The 

5 supernatant (about 66 mL) was further clarified by filtering through a 0.22 p pore size filter 
device (Nalge) and then concentrated to 10 mL using a Centri prep 10 device (Amiccn). Half of 
this material was applied to a Superdex 75 HR26/60 gel filtration column (Pharmacia), at a flow 
rate of 1.3 mL/min using PBS as the buffer. The gel filtration profile (See Figure 18) gave three 
pools of single chain species indicating the formation of monomeric single chain antibody 

10 (scFv, fraction 19-23), dimers(scFv2, fractions 13-15) and multimeric forms (scFv(2n) where n>l, 
fractions 8-10). On the SDS-PAGE insert, lane "S" indicates the identical purified protein 
standard CC49 5cFv5.2 derived from E, co//p5CFV UHM5. 2 (Example 1), In the processing of 
running SDS-PAGE, the non-covalent interactions in a dimer or multimer are caused to break 
apart, showing that the dimer and multimer forms correspond to the same size as the basic 

^5 monomeric unit. 
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Each of th single chain antibody pools was further purified by us of a Mono Q 
HR5/5 anion exchange column as described in Example 1. Eachof the various species was then 
re-chromatographed on an analytical gel filtration column (Superdex 75 HR1 0/30, 
Pharmacia-LKB). Table II gives the elution time, elution volume, Kav values and molecular 

5 weights determined for the monomeric and various multimeric forms. The predicted molecular 
weights were predicted by DNASTAR^ protein Titrate computer program using the amino acid 
composition of the polypeptide. 

The Anion Exchange Chromatography and 5DS-PAGE are given in Figure 19. 
Relevant parts A, B, and C correspond to the monomer, dimer and higher multimer forms of 

10 the scFv species, respectively. The buffer system used and the gradient of Buffer B are also 

shown. TheSDS-PAGE insert (D) has the following samples: molecular weight markers (lane 1); 
monomer and dimer (lanes 2 and 3); and E. co// derived CC49 scFv 5.2 standard (lane 4); 
tetramer. hexamer and putative ortamer (lanes 5, 6, and 7, respectively) and an approximately 
34 Kd impurity (lane 8) derived from fraaion 45. 

-| 5 The lEF gel and Western, shown in Figure 20 indicate that the monomer and 

various multimer forms have distinct pi values. Both the E. co// and P/c/)/a derived monomer 
forms, have a pi of 4.92 (derived from a graph of distance from cathode vs. pi of the standards), 
with a minor band at pi 4.97, which is the same as the pi for all multimers. That 2 bands seen in 
the monomer (lanes 3 and 9) support the hypothesis that there are two charge forms for the 

20 monomer in equilibrium, where the pi 4.97 form has a tendency to dimerize. In Figure 20, lanes 
1-7 represent bands stained with Coomassie Brilliant Blue R-250 and lanes 8-13 represent a 
Western transfer of samples corresponding to lanes 2-7. The pi markers (lane 1) were from 
Sigma: soybean trypsin inhibitor, pi 4.55; bovine p-laaoglobulin A, pi 5.13; and human 
erythrocyte carbonic anhydrase B, pi 6.57. Lanes 2 and 8 show purified CC49 scFv5.2 derived 

25 from £. co// pSCFV UHM5.2; lanes 3 and 9 show the same product derived from Ptchia. Lanes 4 
and 10 show a sample of crude culture supernatant from Pichia containing alt the CC49 single 
chain antibody species present. Lanes 5 and 11,6 and 1 2, and 7 and 1 3 indicate the Pichia 
dimer, tetramer and hexamer species, respectively. 

Purification of Single Chain Antibody Species of Pichia pastoris GTS1 1 5/pPY32 
3Q R pastoris having the plasmid pPY32 produces CC49 single chain antibodies 

having a pi of about 8. 1 . The growth; harvesting and induction of the AOX1 promoter were 

the same as for the pPY31 clone above. 

The purity of the 8. 1 monomer and dimer (multimeric forms were not analyzed) 

was verified by SDS-PAGE. The lEF pattern of these molecules indicated that except for 
35 carboxyl terminal Ser residue present in the £. co// derived CC49scFv8.1 and Fv28.1,the 

corresponding Pichia products have the same pi values. In Pichia the carboxyl terminal residue 

is Tyr. 
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The N-terminal amin acid sequencing results indicated that the invertase signal 
sequence used in constructing these single chain antibodies was accurately being removed by 
signal peptidase. For the CC49 scFvS.I P and Fv2-8. I P proteins, the following NH^ terminal 
residues were determined: Asp-lle-Val-Met-Ser-Gln-Ser-Pro-Ser-Ser, which matches the 
5 predicted amino acid sequence of the mature form, from the DNA sequence (see Figure 16). 
Forthe CC49 Fv2-5.2 product, the same results were obtained as above, except that the first and 
tenth amino acids could not be unequivocally assigned. Based on'the above results with the 
8. 1 pi system, it is reasonable to assume for the 5.2 pi products that the unassigned residues are 
the same. 

^0 From the competition ELISA results (Figure 21 A and 2 IB), it can be observed that 

the avidity of the single chain species, increases with higher multimer forms. It is possible that 
the effects seen for the tetramer and hexamer are underestimated and that different 
concentrations or densities of TAG-72 on the ELISA plate may influence the results. The 
availability of antigen to some of the binding faces of the hexamer and octamer, for example, 

^ 5 may not be sterically aval lable. 

Example ^ Biodistribution Characteristics of '"I Labeled CC49 Single Chain Fv Spedes 

The biodistribution of four CC49 single chain antibody species, two monomeric 
species: CC49 scFv with a pi of 8. 1 obtained from Pichia (scFv 8. 1 P) and CC49 with a pi of 5.2 also 
obtained from Pichia (scFv 5.2); and two dimeric species: CC49 Fv2 with a pi of 8. 1 obtained 

20 ^rom Pichia (Fv2-8. 1 P) and CC49 Fv2 with a pi of 5.2 obtained from £. coK (Fv2-5.2), was 
determined. This was compared with ChCC49 Fab and IgG. The single chain species were 
labeled with using the lodo-Beads"* (Pierce, Rockford. IL) method using N-chloro- 
benzenesulfonamide derivatived polystyrene beads (lodo-Beads™, Pierce, Rockford, IL). To 
label the antibodies, initially 3 lodo-Beads*" were incubated at room temperature for 5 minutes 

25 in PBS and ISOyCi carrier-free 1-125 (Nordion, 100.9 pCi/pL). At this point, the antibody (0.3 mg 
in PBS) was added and allowed to react for 8 minutes. Unincorporated isotope and reartion 
by-products were removed by gel filtration chromatography (Pharmacia Superdex 75 HR10/30 
column), using PBS as the buffer at a flow rate of 0.6 mUminute. An aliquot of the purified 
protein peak was measured in each case at A280nm and quantitated according to the E^ '/»280nm 

3Q values as described in Example 1 forthe respective proteins. Levels of radioactivity were 
determined by measurement in a Capintec Counter, set for reading 1-125. 

SDS-PAGE and lEF analyses were performed as previously described. Gels were 
dried and radioaaive bands detected by exposure to Kodak XAR X-OMAT film. The observed pi 
values forthe 8. IP species, as determined by lEF are 7.1 and 7.25 for monomer and dimer, 

35 respertively, and forthe 5.2 species, 4.9 and 5.0 for the monomer and dimer, respectively. 

Differences between the calculated values (after which the species are officially called) and the 
observed values indicate that the tertiary struaure of these molecules play a role in charge- 
charge interactions and/or hydrogen bonding which will determine the observed pi values. 
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Five test groups, each containing 5 female nu/nu (CD-I) mice (obtain d from 
Charles River Breeding Laboratories) bearing LS174Thuman tumor xenografts, were used to 
measure the biodistribution of '^^1 labeled antibody species. The biodistribution of the 
compounds was measured at various time intervals over 48 hours, after intravenous (tail vein) 
5 injection of the compounds. Animals were housed and identified by group as outlined in 
Section 4.0 of Standard Operation Manual for the Care of Laboratory Animals (50P-PTG). At 
the appropriate time, the mice were euthanatized and the radioactivity remaining in samples 
of blood, liver, spleen, kidney, tumor, lung, Gl tract, tail, and remaining carcass were quantifi d 
as outlined in Seaion 4.0 and Section 5.0 of SOP-PTG. The LS174T tumors were produced in the 
1 Q mice according to the procedure set forth in Section 2.0 of SOP-PTG. 

For metabolite studies, groups of Balb/c mice were injected intravenously with 
1 pCi of one of the antibody species. Mice from each group were anesthetized after 
30 minutes, 2h and 5h, urine removed from the bladder, urine counted for radioactivity and 
then kept frozen at -70'C until HPLC analysis. A Pharmacia Superdex 75 gel fiitration HR10/30 
^5 column was used for chromatography, using PBS as the eluent at 1.5 mL/minute. The volumes 
of the injeaed urine varied from 20-80 pL. The column eluent passed through a dual detertor 
for absorbence at 280nm and for radioactivity. 

Gamma Counter #2 (5earle/ND-66) was used to measure radioactivity in standards 
and tissue samples. Biodistribution of radioactivity in the colleaed tissues was analyzed using 
20 template TEMPL as outlined in Seaion 5.0 of SOP-PTG. Experimental outliers were treated as 
outlined in Section 6.0 of SOP-PTG. 

Table IV gives the amount of protein and activity of each species injected into the 
mice for biodistribution studies. 

Tables IV, V, VI, VII. VIII, and IX give the biodistribution of 125| CC49 species of 
25 scFv5.2, Fv2-5.2, and scFv8. 1 P, Fv2-8. 1 P, ChFab and IgG, respectively, based on percent injected 
dose per organ. Percent injected dose per organ is calculated by determining the '"I in counts 
per minute (CPM) in each tissue, divided by the injected dose (CPM) multiplied by 100 to give 
percent. 



35 
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The differences in pharmacokinetics for the various sizes and charges of the CC49 
antibody species are demonstrated by the data in taWes IV - IX. In general, the smaller the 
molecule, the faster it clears th blood. The 25-50 K dalton size molecules (all except the IgG) 
clear the blood within 24 hours. The negatively charged Fv2-5.2 dimer has the greatest amount 

5 in the blood at 2 hour (Table V). Both dimers accumulate in the tumor to the largest ertent 
over a 24 hour time period (Tables V and VII), excluding the IgG. 

A major difference between the antibody species is observed in retention by the 
kidneys. The negatively charged monomer species, CC49 scFv5.2, clears the most slowly 
(Table IV). This is surprising in that the negative charge should have prevented electrostatic 

^ 0 interaction with the negatively charged glomerular cells of the kidney. The nominally neutral , 
charged species, scFv8. IP cleared more quickly (Table Vl). This trend was also true for the two 
dimers, but clearance was much better for these relative to the monomers. It is unexpected 
that the negatively charged dimer would have the best tumor localization values (Table V). The 
data also shows that a relatively small molecule (58 Kd) has staying power on the tumor. Both 

^ 5 dimers out performed the monomers and chimeric CC49 Fab molecule in this regard. 

Results from metabolic studies indicated that after injeaion with 125| labeled scFv 
5.2 and Fv2-5.2, only one radioactive metabolite was observed in the urine, free 125|^ as 
determined by gel filtration chromatography. No larger peptides are observed at any of the 
time points. These data indicate that dehalogenation is occurring within the body and that 

20 'free 1-125 isabletobe readily excreted into the urine. 

The tumor to tissue ratios 8 to 24 hours post injection indicated the dimers would 
be effective in radioimmunoguided surgery procedures. Surgery may be performed in a more 
timely fashion, rather than 2 to 3 weeks post injection, as is presently necessitated by the 
biodistribution kinetics of whole antibody. 

Example 4 Construction, Purification and Characterization of Anti TAG-72 Single Chain 
Antibody Species with the Human Subgroup IV Vl 

To provide for a rapid and convenient procedure to determine the effectiveness 
of the single chain dimers and multimers of the present invention, a unique octapeptide 
sequence -NH2-Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys-COOH. called the FLAG"" peptide (a 
trademark of International Biotechnologies Inc., CT) was used. This sequence is recognized by 
International Biotechnology Incorporated's (IBI) mouse M2 monoclonal antibody. Placing the 
FLAG^ sequence on two annealing oligonucleotides in pSCFV UHH leads to a single chain 
antibody having the human subgroup 4 Vl and CC49 Vh with the FLAG peptide attached at the 
carboxyl terminus. 

The finoi plasmid derived from pSCFV UHH containing FLAG^" is designated 
pSC49FLAG. The nomenclature for the single chain proteins produced from pSC49FLAG is as 
follows: for the monomer-H4L49HF scFv, for the dimer - H4L49HF Fv2 and for the tetramer- 
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H4L49HF Fv4. The •H4L'' portion refers to the human Subgroup IV light chain V^^ regi n, 'ASH'* 
refers to the CC49 Vh and "F" indicates that the FLAC", peptide is attached. 
Construction of pSC49FLAG 

A general outline for obtaining the plasmid pSC49FLAG is shown in Figure 22 and 
5 the procedure described below. 

a) Isolation of pSCFVUHHXho //A/he / Vector Fraament 

Approximately 5 pg of plasmid pSCFV UHH (Example 1)in 1 5 pL of water was used 
from the MagicMini-prep system (Promega). To this was added 5.4pLof 1 0X Buffer #2 (New 
England Biolabs), 45 units of Xho /(New England Biolabs), 15 uniuof /S/he /and 24 pL of water. 
^0 The reaction proceeded for 1 hour at 37*C. The sample was then loaded with running dyes on a 
4 percent polyacrylamide gel, electrophoresed and purified by electroelution (Sambrook et al, 
supra). The obtained DNA pellet was dissolved in 20 pL of water. 

b) Construction of pATDFLAG 

Plasmid pSCFV UHH treated with Xho /and A/he /from above was used in a 
^5 ligation reaction with annealed FLAG and FLAGNC oligos. 
FLAGC: 

5'-TCGAGACAATGTCGCTAGCGACTACAAGGACGATGATGACAAATAAAAAC-3' 
FLAGNC: 

5'-CTAGGTTTTTATTTGTCATCATCGTCCTTGTAGTCGCTAGCGACATTGTC-3' 
20 Equimolar amounts (3 pM of each of the oligos) FLAGC and FLAGNC in a total 

volume of 62,5 pL) were mixed together using a ligation buffer diluted to IX (Stratagene). The 
sample was heated to 95*C for 2 min and was allowed to cool to below 35*C over 10 min before 
use in the ligation reaction below. 

2^ COMPONENT 

pSCFVUHH Xho I /N he I 
vector (-*100 ng) 

ANNEALED FLAGC/FLAGNC 

lOX Ligation buffer 

30 T4 DNA Ligase 

10 MM ATP 

ddH20 

The reaction was carried out using the components and amounts indicated above. 
35 Starting at 18*C, and then being cooled gradually to A^C overnight. Competent E co// Agl cells 
(Stratagene) were transformed with 3 pi of the above ligation reaction and colonies selected 
using LB/CAM 20 plates. A clone having the appropriate Nhe //Xho / and Nhe //Xho / restriction 
patterns was selected for DNA sequencing. 
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1.0 IIL 

0.85 pL 
2 iiL 

1 pL 

2 pL 
13.2 pL 
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The oligonucleotide used to verify the sequence of the FLAG^ linker in 
pATDFLAG was called PENPTSEQ: 5'-CTTTATGTAAGATGATGTTTTG-3. This oligonucleotide is 
derived from the non-coding strand of the pen? terminator region (See Figure 23). Figure 23 
gives the amino acid and DNA sequence in pATDFLAG. The amino acid sequence 
5 corresponding to the FLAG"* peptide is given in bold italics. The FLAG'" peptide is out of frame 
relative to the H4VL-linker sequence which ends at the Xho I site. The DNA sequencing pri mer, 
PENPTSEQ, is shown in bold. Nucleotides 824-478 inclusive were verified using the PENPTSEQ 
primer. At position 624, a 'T nucleotide present in the original H4V^ sequence has been 
changed to a 'C nucleotide resulting in a Val to Ala amino acid residue 89 substitution. These 
differences are indicated in shadow. The CDR amino acid sequences of H^VL are shown in bold, 
while the UNIHOPE linker sequence is in bold and underlined. DNA sequencing was performed 
using the Sequenase"" sequencing kit (U.S. Biochemical, Cleveland, OH) following the 
manufacturer's directions. 

c) Generation of DSC49FLAG 

The plasmid pATDFLAG (approximately 5 pg, purified from a 2.5 mL culture by the 
Magic Miniprep system (Promega) was treated at 37"Cfor 1 h as indicated in the protocol given 
below: 

DNA - 17iiL 

1 0X buffer (New England Biolabs #2) - 6 pL 
2Q Xho / (New England Biolabs) - 2.5 jiL (25 units) 

A/he / (New England Biolabs) - 3.5 pL (1 7.5 units) 
H2O - 31 iiL 

The resulting veaor fragment was purified from a 4 percent PAG, electroeluted and ethanoi 
precipitated. The obtained DNA was dissolved in 20 pL water. 

The CC49 Vh insert DNA was obtained from a PCR amplification protocol (Perkin 
Elmer Cetus) as instruaed by the manufaaurer. The target DNA (uncut) containing the CC49 
Vh was pSCFV UHM 8.1 (Example 1) (0.5 mg). The 5' oligo used for the PCR (100 pmol) was 
UNI-3 (Example 1). The 3' oligo used (100 pmol) was called SC49FLAG, a 33-mer as follows: 
Khe I 

5'-TATTGCTAGCTGA GGA GAC GGT GAC TGA GGT TC-3'. 

The resulting amplified DNA was purified from a 4 percent PAG, dissolved in 20 pL water, and 
then treated at 37°C for 1 h as indicated in the protocol given below: 
DNA- 10pL(-2pg) 

1 OX buffer (New England Biolabs #2) - 6 pL 
35 Xho / (New England Biolabs) - 2.5 pL (25 units) 

A/he /(New England Biolabs) - 3.5 pL (17.5 units) 
water - 38 pL 
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The resulting insert fragment was purified and dissolved in 20 water. The vector and insert 
fragments from above were set up in a ligation reaction at 1 8'C going down to 4»C overnight as 
follows: 

DNA vector (pATDFLAG-X/io l/Nhe /) - 1 .5 pL 
5 DNA insertCPCR amplified CC49VH-Xho//^he 0-0.8 iiL 

1 0X ligation buffer (Stratagene) -. 2 viL 
10mMATP-2iiL 
T4 DNA ligase - 1 pL 
water - 12.7 pL 

1 0 Competent E. co/i AG 1 cells (Stratagene) were transformed using the ligation reaction from 
above, as directed by the manufacturer. Clone number 1 . was selected for further work, and 
for verification of the construrted DNA sequence. The oiigos used as primers for the DNA 

sequencing were as follows: 

1) PENPR1,a21-mer: 5'-AACACTGTAGGG ATAGTGGAA-3'; 
,5 2) PENPR2.a20-mer: 5'-GTCTCCaCCGTTTG AATAT-B'; 

3) TAGVLFR2. a20.mer: 5'-TGG TAC GAG CAG AAACCA GG-3: 

4) UHVHSEQ,a22-mer: S'-GAT Ga GCG AAG AAG GAT GAC G-3'; 

5) TAGVHCDR2.a21-mer: 5'-ATGG ATTGGATATTTTTC^CC-3'and; 
6) TAG VH FR4, a 21-mer: 5'-ACT GGG GTC AAG GAA CCT CAG-3'. 

2Q The DNA and amino acid sequence of pSC49FLAG is given in Figure 24. Asp = 1 

indicates the first amino acid from H4V^ of the mature single chain antibody species. The 
UNIHOPE linker is in bold and underlined. The complementarity determining region sequences 
are shown in bold, while the FLAG peptide is in bold italics. Nucleotides corresponding to the 
Nhe I site in pSCFV UHH are in bold and underlined (position 1 1 54-1159). The 'C at position 
25 1155 and the 'A' at position 1 157 are italicized within this sequence and indicate differences 
from the published penP terminator sequence. The DNA sequencing was carried out using the 
Sequenase~ kit mentioned above. 

The predicted pi and molecular weight (MW) of each of the protein species 
produced was obtained from the DNASTAR" Protein-Titrate computer program (Madison. Wl). 
3Q Purification of H4L49HF Single Ch ain Products 

A 10 mL IVI2 antibody affinity column was prepared (gel purchased from IBI) and 
incorporated on the HPLC system described under general experimental procedures. The 
H4L49HF crude protein sample was prepared as follows. One liter (2 X 500 mL) of E. coli 
AG 1/pSC49FLAG was grown overnight at 37°C in LB broth containing 20 pg/mL 
35 chloramphenicol. The cells were recovered by centrif ugation of the culture at 5.000 rpm for 
10 min at 5'C in a Sorvall GS-3 rotor (duPont). The cell pellet was resuspended in 100 mL of PBS. 
and subjeaed to sonication (M5E Soniprep 150) for 5 bursts at 14 microns amplitude lasting for 
15 seconds each, and cooling in an ethanol-ice bath for 1 minute in between. The sonicate was 

-53- 



BNSDOCID: <WO _.B404691A2.I > 



wo 94/04691 



PCT/US93/07918 



ultracentrifuged at 26,000 rpm for 2 hours in a Type Ti30 rotor (Beckman InstrumenU). Th 
resulting supernatant was filtered through a 0.22 membrane device (Nalge Co.. Rochester. NY) 
and finally concentrated to 22 mL using Centriprep 30 (Amicon) devices. The sonicate was 
pumped onto the affinity column at 2 mL/min with stops lasting for 1 min twice during the 
5 application, to allow interaction with the affinity ligand. 

PBS was used to wash the column of any unbound material, and when the 
absorbance at 276 nm returned substantially to the baseline, the buffer was changed to 0.1 M 
glycine-HCI.atpH3.1. Elutionof a peak began 4 minutes later, the fractions of which were 
immediately neutrali2ed using 1 M Tris pH 8.8 and PANPEHA indicator (range 0-14) pH paper 
,0 stri ps (Schleicher Schull). The eluted protein vvas concentrated to 200 pL using Centriprep 30 
and Centricon 30 (Amicon) devices. 

The resulting single chain antibody species were separated by gel filtration using 
a Pharmacia Superdex 75 Prep Grade HR 26/60 column and PBS as the eluent. The flow rate was 
initially 0.4 mL/min, and then increased to 1 .2 miymin during the chromatography. Three pools 
,5 of frartions from this run were made, and then tested analytically on a Superdex 75 HR10/30 
column for purity and characteristic elution time. Biorad gel filtration standards were also 
chromatographed on this column using the same buffer (PBS) and a 0.6 mL/min flow rate, to 
generate a standard curve for comparison with the elution times of the single chain antib dy 
species. The three species obtained were concentrated, quantitated at OD 280 nm and stored 
20 frozen at ^20''C. 

The results of the purification and characterization of the H4L49HF single chain 
products is given in Figure 25. Panel A shows the M2 affinity chromatography profile for £. col, 
PSC49FLAG sonicate preparation. Stars (*) indicate points in the application of sample when 
the flow was stopped for 30 seconds. The arrow indicates the point at which the buffer was 
25 changed and the flow chart increased to 2.0 mUmin. Fraaions were 2 minutes long. 

Panel B represents an analytical gel filtration chromatogram obtained from 
application of 0.5 percent of the total pooled sample from fraaions 19-21 shown in A. 

Panel C shows a 1 0-20 percent gradient SDS-PAGE of samples stained with 
Coomassie Brilliant Blue R-250 (Lanes 1-6) or Western transferred (lanes 7-1 1) and probed with 
30 biotinylated FAID14. Lane 1 : prestained molecular weight standards (Diversified Biotech); • 
lanes 2-4 and 7-9 are purified H4L49HF scFv, Fv2 and Fv4. respeaively; lanes 5 and 10; E. co/# 
PSC49FLAG sonicate sample, the same as that as applied to M2 affinity column; and lanes 6 and 
1 1 . Pichia CC49scFv 5.2. 
Amino Terminal Seouencino 

Samples of the three protein species of H4L49HF (10 pg each) were desalted by 3 
successive water washings using Centricon 30 devices and freeze-dried. Amino terminal 
sequencing was determin d by standard Edman degradation techniques. 
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Determination of Anti TAG-72 Activity 
a) anti-FLAGELISA 

A. Tag Elisa 

Binding activities of the purified monomer and multimer forms of H4L49HF as 
5 well as sonicates of f . co// expressing these species were determined by ELISA (Enzyme Linked 
Immunosorbant Assay) on 96 well PVC plates coated with a 1/300 dilution of the TAG-72 prep 
(040191). The plates were prepared and blocked for two hours with 1 percent bovine serum 
albumin (BSA) in PBS with 0.025 percent sodium azide at 37**C After blocking, the plates were 
washed three times with 0.025 percent Tween 20 solution and stored at 4*'C. 
^ Q Aliquots of 50 pL of sample at each concentration were added in duplicate to the 

96 well plate. The starting concentrations for the purified monomer, dimer, and tetramer were 
lOpg/mL for each, and 1:2 dilutions were made to a final concentration of 0.005 pg/mL. The 
samples added to the plate included the purified monomer, dimer, tetramer, and various crude 
sonicates. 

^ 5 After adding the samples, the plate went through a series of incubations at 37*C. 

The first incubation was for 1 hour with sample. The plate was washed five times with 
0.025 percent Tween 20. Binding of FLAG™ containing single chain species was detected by 
subsequent incubations with murine anti-FLAG MAb, M2 (International Biotechnologies 
cat: IB 13010) at 1/250 dilution. The plate was incubated for 1 hour removed, and washed. 

20 Alkaline phosphatase conjugated goat anti-mouse polyclonal antibody (GAMIG-ALPH, 

Southern Biotechnology Associates) was added at 50 pL per well. The plate was incubated for 
1 hour, removed, and washed. The diluent for both antibodies was in 1 percent BSA in PBS with 
0.025 percent sodium azide. The phosphatase substrate (Kirkegaard & Perry Laboratories, Inc.) 
was prepared by mixing 1 ml of 5X buffer (Kirkegaard & Perry Kit), 4 mL water, and one 

25 substrate tablet (Kirkegaard & Perry Kit). To each well, 50 pL were added. Color development 
was terminated after 1 5 minutes with 50 pL/well of 1 M sodium hydroxide. The ELISA plate was 
read with the Molecular Devices Kinetic Microplate Reader using the Delta Soft® Computer 
program. 

B. BSM ELISA 

Bovine submaxillary mucin (BSM) was applied to a PVC plate at concentrations 
ranging from 5.0 mg/ml to 0.00008 mg/mi decreasing at 1 :2 dilutions in water. A blank of 50 pL 
of PBS with 0.025 percent azide was added as a control. The plate was allowed to dry at room 
temperature overnight in a biological safety cabinet. The plate was blocked using 200 pL of 
1 percent BSA in PBS. The plate was incubated at 37*C for 50 minutes. The plate was removed 
from incubation and washed three times with 0.025 percent Tween 20. Dilutions (1/3, 1/9, 1/27) 
of a crude sonicate of pSC49FLAG were applied to the BSM control wells. 

After obtaining results from the above ELISA, a BSM concentration of 60 pg/ml 
was selected for further plating. The diluent was water. The BSM at 60 pg/ml was loaded at 
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50}iL per well using a multi-channel pipet and dried and blocked as above. The plates were 
washed three times with 0.025 percent Tween 20 and stored at 4**C. Incubation and detection 
of FLAG-containing species was performed as above, using the M2 anti-FLAG antibody. 

The relative binding of H4L49HF single chain species by TAG of BSM ELISA are 
5 given in Figure 26. Figure 26A shov« results from TAG ELISA for purified monomer, dimer and 
tetramer. Figure 263 shows results from TAG ELISA for E. coH pSC49FLAG sonicate containing 
monomer and multimer species. Figure 26C shows results from various concentrations of the 
single chain species reacting with BSM. Figure 26D is ELISA on BSM with pSC49FLAG sonicate. 

In Figure 26A, starting with concentrations of lOug/ml, 1:3 dilutions were 
performed and the relative binding to TAG-coated plates was determined. It can be seen that 
the tetramer species gave the greatest binding, followed by dimer, and eventually by 
monomer. Using DeltaSoft software (BioMetallics, Princeton, NJ), and assuming the dimer as 
the standard curve, the tetramer activity was 1 .3 to 5 times greater than the dimer activity. The 
monomer activity was only 0.5 to 0.8 percent of the dimer activity. 
^5 b) Competition ELISA 

This assay was performed as described in Example 1. The formula used to 
calculate percent competi'don is as follows: 

No Competition - Sample reading j^qq 
No Competition - Background 
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Absorbance values for the samples were measured at 405 nm -450 nm. The average of 
duplicate readings was used. Initially samples (25 pL) were applied to the TAG-72 coated 
microliter plates at 2.0 X 10 'VmL. Biotin CC49 (4 pg/pL diluted 1:20,000- used 25 pL) diluted 
the samples by a factor of 2. Serial dilutions (1 :2) were performed, resulting in a total of 10 
concentrations for each sample. The *no competition" sample consisted of 25 pL of antibody 
buffer (1 percent BSA in Tris buffered saline (Pierce Chemical Co.) + 0.025 percent Tween and 
25 pL of the biotinylated CC49 competitor. Background samples were formulated with either 
antibody buffer with no added CC49 biotin, or a 5 pg/mL solution of CC49 IgG in antibody 
buffer. Both give essent. ally the background reading. 

The results of the competition ELISA are given i n Figures 27 and 28, Figure 27 
based on moles of antibody binding site and Figure 28 based on moles of antibody. These 
results show that the extra FLAG ~ binding sites on the dimer and tetramer did not skew the 
results in favor of the multimers. 
Immunohistochemistn/ (IHC) 

Specimens. A total of 7 specimens of colorectal adenocarcinoma, including four 
primary adenocarcinomas and three intra-abdominal metastases (two mesenteric metastases 
and one hepatic metastasis), were used in these studies. Areas of reactive or normal tissue were 
also present for assessment of immunoreaaivity of non-neopiastic tissues. Sequential 5 pm 
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10 



thick sections were prepared. One section was stained with hematoxylin and eosin (H and E) 
with other sections used for the various IHC procedures. 

The purified H4L49HF Fv2 product was used at 0.33, 1 .7 and 8.3 jig/mL levels as 
the primary probing agent forthe specimens. Sonication (as per Example 1) of a 2.5 mL 
overnight culture of £. co// pSC49FLAG provided a crude test probe material and was used for 
the specimens. These sonicates contained a mixture of H4L49HF scev,|v2, Fv4 etc. whose 
concentration and composition was not known. 

I m m u noh i stoc he m 1 strv p roced u res ( I H C) . IHC was performed using a 
modification of the avidin-biotin-peroxidase complex (ABQ method (Hsu etal., / Histochem. 
Cytochem,, 29, 577-580, 1981). The method utilized a commercial anti-mouse IgG kit 
(Vectastain Peroxidase Mouse IgG, Vector Laboratories, Burlingame, CA). Briefly, the technique 
was as follows: 

1) Sequential 5-6 p sections of the test tumors were deparaffinized, rehydrated, 
and endogenous peroxidase artivity was quenched. 

15 The sections were then sequentially flooded with the following reagents: 

2) Normal equine blocking serum (Veaastain kit) for 20 minutes. 

3) Add primary antibody for 45 minutes, either 200 pL of H4L49HF Fv2 or sonicate 
of £. co/< p5C49FLAG (from 2.5 mL of overnight cell growth), and add to sequential sections of 
the tumor. Diluent (PBS containing 0.1 percent BSA, Sigma Chemicals Co. St. Louis, MO) was 

20 used on one section from every tumor at this step to serve as a negative control. 

4) M-2 (International Biotechnologies Inc., San Pedro, CA), murine anti-FLAG 
MAb, was then used at a dilution of 1 : 1 00 for 45 minutes. 

5) Secondary antibody, biotinylated anti-mouse IgG (Vectastain kit), for 

45 minutes. 

25 6) ABC reagent for 45 minutes. 

7) The chromagen, 0.05 percent diaminobenzidine. in 0.02 percent H2O2 for 

3.5 minutes. 

8) The slides were counterstained with Meyer's hematoxylin (Sigma Diagnostics, 
St. Louis, MO), dehydrated, and coverslipped by routine methods. 

3Q All reactions were conduaed at room temperature. The slides were rinsed between each step, 
except following step 2, in three washes of PBS for at least five minutes each; the final rinse 
contained 0. 1 percent BSA. IHC reactivity was evaluated for both the acelluiar and cellular 
parts of the tumor as well as for adjacent non-neoplastic struaures 

The H4L49HF purified dimer stained both cellular and acelluiar areas of the test 

35 colon carcinomas. Mucinous acelluiar material from the neoplastic cells was extensively 
reactive. Viable neoplastic cells had more variable reactivity with some tumors having high 
numbers of reactive cells and other tumors having few reartive cells. The specific cellular 
location of the staining was also variable within and between the different specimens. The 
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atypical cytoplasm and/or (atypical) surface adjacent stained aceliular material was most 
frequently stained. Lesser members of cells had diffuse cytoplasmic staining. There was little 
reactivity of H4L49H Fv2 with limited numbers of non-neoplasmic tissues examined except 
reactive colon mucosa immediately adjacent CRC In this mucosa, the mucin vacuoles of goblet 
5 cells and the free mucin onthe surface was reactive. The staining diminished with the distance 
from the tumor/reactive mucosa junction. Crude E. coli pSC49FLAG sonicate, containing all the 
H4L49HFscFv species (monomer, dimer, multimer, etc.) also gave similar results, indicating that 
these single chain antibodies are specific for tumor tissues. A control sonicate from f. coli 
pATDFLAG gave no staining of the TAG-72 positive areas. 
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Claims : 

1. A protein comprising a dimer or multimer of single chain polypeptides 
wherein the polypeptides are non-covalently linked and each polypeptide comprises 

(a) a first polypeptide comprising an antigen binding portion of a variable 
5 domain of an antibody; 

(b) a second polypeptide comprising an antigen binding portion of a variable 
domain of an antibody; and 

(c) a peptide linker linking the first and second polypeptides (a) and (b) into a 
single chain polypeptide having affinity for an antigen. 

2. The protein of Claim 1 wherein the first polypeptide is an antigen binding 
portion of a heavy chain variable region of an antibody and the second polypeptide is an 
antigen binding portion of a light chain variable region of an antibody. 

3. The protein of Claim 1 wherein the first polypeptide is an antigen binding 
portion of a light chain variable region of an antibody and the second polypeptide is an 

^ 5 antigen binding portion of a heavy chain variable region of an antibody. 

4. The protein of Claim 3 wherein the polypeptide is a dimer. 

5. The protein of any one of Claims 1 to 4 conjugated to an imaging marker. 

6. The protein of Claim 5 wherein the imaging marker is ^251, i3i | j nin, ^osRh, 
i53Sm, 67Cu, 67Ga. ^66Ho. 177LU. i86Re, i88Re or 99mJc, 

20 7. Acompositioncomprisingaproteinof any one of Claims 1 to 6 in a 

pharmaceutical ly acceptable carrier. 

8- The protein of any one of Claims 1 to 4 conjugated to a therapeutic agent. 
9. The protein of Claim 8 wherein the therapeutic agent is a radionuclide, 
drug or biological response modifier or toxin. 
25 10. The composition of Claim 9 wherein the radionuclide is i^M, soy, lOSRh, 

^7Sc, 67cu,2i2Bi, 2iiAt, 67Ga, 1251, i86Re, i88Re, i77Lu,99mTc, i53Sm, i23|oriinn. 

11. A composition of any one of Claims 8 to 10 in a pharmaceutical !y 
acceptable carrier. 

12. A method of imaging the internal structure of an animal comprising 

30 administering to the animal an effective amount of a composition of Claim 6 and measuring 
detectable radiation associated with the animal. 

13. A method for producing a dimer or multimer of single chain polypeptides 
wherein the polypeptides are non-covalently linked and each polypeptide comprises (a) a first 
polypeptide comprising an antigen binding portion of a variable domain of an antibody; (b) a 

35 second polypeptide comprising an antigen binding portion of a variable domain of an 

antibody; and (c) a peptide linker linking the first and second polypeptides (a) and (b) into a 
single chain polypeptide having affinity of an antigen, the method comprising: 
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(i) providing a genetic sequence coding for a 
single chain polypeptide; 

(ii) transforming a host cell with the genetic 
sequence; 

(iii) expressing the genetic sequence in the 
host cell; and 

(iv) recovery of single chain polypeptides 
which have non-covalently linked to form 
dimers and multimers. 

14. The method of Claim 13 wherein genetic sequence coding for a single 
chain polypeptide contains a sequence encoding for a signal peptide which will direct secretion 
of the single chain polypeptide out of the cytoplasm of the host cell. 

1 5. The protein of any one of Claims 1-4 wherein the linker is encoded by a 
nucleotide sequence of 

5'-CTTAGTGCGGACGATGCGAAAAAGGATGCTGCGAAGAAGGATGACGCTAAG 
AAAG ACG ATG CTAAAAAGG ACCTC-3' . 

16. The proteinof any of Claims 1-4 wherein the linker has the amino acid 
sequence Leu Ser Ala Asp Asp Ala Lys Lys Asp Ala Ala Lys Lys Asp Asp Ala Lys Lys Asp Asp Ala Lys 
Lys Asp Leu. 

17. A method of detecting an antigen in orsuspeaed of being in a sample 
which comprises: 

(a) contacting the sample with the protein of any one of Claims 1 to 4; and 

(b) detecting whether the protein has bound to the antigen. 
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FIGURE 1 



CONSTRUCTION OF PLASMID pSCFV31 
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FIGURE 2 

CONSTRUCTION OF PLASMID pSCFV31 
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FIGURE 3 
Met Leu 

AAAAACTAT AAGCTCCATG ATG CTT 

Leu Gin Ala Phe Leu Phe Leu Leu Ala 
TTG CAA GCT TTC CTT TTC CTT TTG GCT 



Gly Phe Ala Ala Lys He Ser Ala Asp 
GGT TTT GCA GCC AAA ATA TCT GCA GAC 



He Val Met Thr Gin Ser Pro Asp Ser 
ATG GTG ATG ACC CAG TCT CCA GAC TCC 



Leu Ala Val Ser Leu Gly Glu Arg Ala 
CTG GCT GTG TCT CTG GGC GAG AGG GCC 

: CDRIL — 

Thr He Asn Cys Lys Ser Ser Gin Ser 
ACC ATC AAC TGC AAG TCC AGC TGC AAG 



Val Leu Tyr Ser Ser Asn Asn Lys Asn 
GTT TTA TAC AGC TCC AAC AAT AAG AAC 



Tyr Leu Ala Trp Tyr Gin Gin Lys Pro 
TAC TTA GCT TGG TAC CAG CAG AAA CCA 



Glv Gin Pro Pro Lys Leu Leu He Tyr 
GGA CAG CCT CCT AAG CTG CTC ATT TAC 



: CDR2L : 

Trp Ala Ser Thr Arg Glu Ser Gly Val 
TGG GCA TCT ACC CGG GAA TCC GGG GTC 



Pro AsD Arg Phe Ser Gly Ser Gly Ser 
CCT GAC CGA TTC AGT GGC AGC GGG TCT 



Gly Thr Asp Phe Thr Leu Thr He Ser 
GGG ACA GAT TTC iCT CTC ACC ATC AGC 



Ser Leu Gin Ala Glu Asd Va: Ala Val 
AGC CTG CAG GCT GAA GAT GTG GCA GT ' 
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FIGURE 3 Continued 



J CDR3L 

Tyr Tyr Cys Gin Gin Tyr Tyr Ser Tyr 
TAT TAG TGT GAG CAA TAT TAT AGT TAT 



Pro Leu Thr Phe Gly Gly Gly Thr Lys 
COT CTC ACT TTC GGC GGA GGG ACC AAG 



Val Lys Glu Ser Gly Ser Val Ser Ser 
GTG AAG GAG TCA GGT TCG GTC TCC TCA 



LINKER 

Glu Gin Leu Ala Gin Phe Arg Ser Leu 
GAA CAA TTG GCC CAA TTT CGT TCC TTA 



Asp Val Gin Leu Gin Gin Ser Asp Ala 
GAC GTC CAG TTG CAG CAG TCT GAC GCT 



Glu Leu Val Lys Pro Gly Ala Ser Val 
GAG TTG GTG AAA CCT GGG GCT TCA GTG 



Lys lie Ser Cys Lys Ala Ser Gly Tyr 
AAG ATT TCC TGC AAG GCT TCT GGC TAG 



: CDRIH : 

Thr Phe Thr Asp His Ala He His Trp 
ACC TTC ACT GAC CAT GCA ATT CAC TGG 



Val Lys Gin Asn Pro Glu Gin Gly Leu 
GTG AAA CAG AAC CCT GAA CAG GGC CTG 



Glu Trp He Gly Tyr Phe Ser Pro Gly 
GAA TGG ATT GGA TAT TTT TCT CCC GGA 



CDR2H 

Asn Asp Asp Phe Lys Tyr Asn Glu Arg 
AAT GAT GAT TTT AAA TAC AAT GAG AGG 



Phe Lys Gly Lys Ala Thr Leu Thr Ala 
TTC AAG GGC AAG GCC ACA CTG ACT GCA 
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FIGURE 3 Continued 



Asp Lys Ser Ser Ser Thr Ala Tyr Val 
GAC AAA TCC TCC AGC ACT GCC TAC GTG 



Gin Leu Asn Ser Leu Thr Ser Glu Asp 
GAG CTC AAC AGC CTG ACA TOT GAG GAT 



Ser Ala Val Tyr Phe Cys Thr Arg Ser 
TCT GCA GTG TAT TTC TGT ACA AGA TCC 



CDR3H : 

Leu Asn Met Ala Tyr Trp Gly Gin Gly 
CTG AAT ATG GCC TAC TGG GGT CAA GGA 



thr Ser Val Thr Val Ser 

ACC TCA GTC ACC GTC TCC TAG TGA 



AGCTTGGAAC ACCACACAAA CCATATCCAA A 
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FIGURE 5 



CXa I 

3' ATCGAT AAAA TTTATTGAGA ATTTGTTTAT TATGATTAAC AGAGGTAAAA GCCAGTATAT 34 38 
3 • TAGCTATTTT AAATAACTCT TAAACAAATA ATACTAATTG TCTCCATTTT CGGTCATATA 



TACTGATTAA TATAGGTAAA AGGCAGTTAA GAAATTGGGA ATGCTTTCTC TTCTGCTTTC 34 9 8 

ATGACTAATT ATATCCATTT TCCGTCAATT CTTTAACCCT TACGAAAGAG AAGACGAAAG 

TTCTACGATG CACAAGGCGT TTCACATTTA TGCCOCTATG AAAATTACTA GGCTGTCCTA 3558 
AAGATGCTAC GTGTTCCGCA AAGTGTAAAT ACGGGGATAC TTTTAATGAT CCGACAGGAT 

GTCATTAGAT CTTTCAGCAG TTTGTAGTTT TAGAGCTTCT AAGTTGACTT CTGTCTTTTC 3618 
CAGTAATCTA GAAAGTCGTC AAACATCAAA ATCTCGAAGA TTCAACTGAA GACAGAAAAG 

TATTCATACA ATTACACATT CTGTGATGAT ATTTTTGGCT CTTGATTTAC ATTGGGTACT 3678 
ATAAGTATGT TAATGTGTAA GACACTACTA TAAAAACCGA GAACTAAATG TAACCCATGA 
HUMLZNl ( - ) 

TTCACAACCC ACTGCTCATG AAATTTGCTT TTGTACTACT GGTTGTTTTT GCATAGGCCC 3738 
AAGTGTTGGG TGACGAGTAC TTTAAACGAA AACATGATGA CCAACAAAAA CGTATCCGGG 

CTCCAGGCCA CGACCAGGTG TTTGGATTTT ATAAACGGGC CGTTTGCATT GTGAACTGAG 3798 
GAGGTCCGGT GCTGGTCCAC AAACCTAAAA TATTTGCCCG GCAAACGTAA CACTTGACTC 

Met Val Leu Gin Thr Gin Val Leu lie 
CTACAACAGG CAGGCAGGGG CAGCAAG ATG GTG TTG CAG ACC CAG GTC TTC ATT 3852 
GATGTTGTCC GTCCGTCCCC GTCGTTC TAC CAC AAC GTC TGG GTC CAG AAG TAA 

Ser Leu Leu Leu Trp lie Ser G:Intron 

TCT CTG TTG CTC TGG ATC TCT G GTGA GGAATTAAAA AGTGCCACAG TCTTTTCAGA 3908 
AGA GAC AAC GAG ACC TAG AGA C CACT CCTTAATTTT TCACGGTGTC AGAAAAGTCT 

HUMLIN2(-) 

GTAATATCTG TGTAGAAATA AAAAAAATTA AGATATAGTT GGAAATAATG ACTATTTCCA 3968 
CATTATAGAC ACATCTTTAT TTTTTTTAAT TCTATATCAA CCTTTATTAC TGATAAAGGT 

Bam HI 

ATAT GGATCC AATTATCTGC TGACTTATAA TACTACTAGA AAGCAAATTT AAATGACATA 4028 
TATACCTAGG TTAATAGACG ACTGAATATT ATGATGATCT TTCGTTTAAA TTTACTGTAT 
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FIGURE 5 Continued 



TTTCAATTAT ATCTGAGACA GCGTGTATAA GTTTATGTAT AATCATTGTC CATTACTGAC 4088 
AAAGTTAATA TAGACTCTGT CGCACATATT CAAATACATA TTAGTAACAG GTAATGACTG 

+1 

ly Ala Tyr Gly Asp lie Val Met Thr Gin Ser Pro Asp Ser Leu 
TAG AG GT GCC TAG GGG GAC ATC GTG ATG ACC GAG TCT CCA GAC TCC CTG 4137 
ATGTC CA CGG ATG CCC CTG TAG CAC TAC TGG GTC AGA GGT CTG AGG GAC 



Ala Val Ser Leu Gly Glu Arg Ala Thr lie Asn Cys Lys Ser Ser Gin 

GCT GTG TCT CTG GGC GAG AGG GCC ACC ATC AAC TGC AAG TCC AGC CAG 4185 

CGA CAC AGA GAC CCG CTC TCC CGG TGG TAG TTG ACG TTC AGG TCG GTC 

Ser Val Leu Tyr Ser Ser Asn Asn Lys Asn Tyr Leu Ala Trp Tyr Gin 
AGT GTT TTA TAC AGC TCC AAC AAT AAG AAC TAC TTA GCT TGG TAC CAG 4233 
TCA CAA AAT ATG TCG AGG TTG TTA TTC TTG ATG AAT CGA ACC ATG GTC 

HUMLCDRl { - ) 

. CDR 2 

Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu lie Tyr Trp Ala Ser Thr 

CAG AAA CCA GGA CAG CCT CCT AAG CTG CTC ATT TAC TGG GCA TCT ACC 4281 

GTC TTT GGT CCT GTC GGA GGA TTC GAC GAG TAA ATG ACC CGT AGA TGG 



Arg Glu Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr 

CGG GAA TCC GGG GTC CCT GAC CGA TTC AGT GGC AGC GGG TCT GGG AC A 4 329 

GCC CTT AGG CCC CAG GGA CTG GCT AAG TCA CCG TCG CCC AGA CCC TGT 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Ala Glu Asp Val Ala Val 

GAT TTC ACT CTC ACC ATC AGC AGC CTG CAG GCT GAA GAT GTG GCA GTT 4 37 7 

CTA AAG TGA GAG TGG TAG TCG TCG GAC GTC CGA CTT CTA CAC CGT CAA 

: CDR 3 : 

Tyr Tyr Cys Gin Gin Tyr Tyr Ser Tyr Pro Leu Thr Phe Gly Gly Gly 

TAT TAC TGT CAG CAA TAT TAT AGT TAT CCT CTC ACT TTC GGC GGA GGG 4 4 25 

ATA ATG ACA GTC GTT ATA ATA TCA ATA GGA GAG TGA AAG CCG CCT CCC 

Thr Lys Val Val He Lys A Hind III 

ACC AAG GTG GTG ATC AAA C GTAAGTACAC TTTTCT AAG CTT 3' human Ctc 

4466 

TGG TTC CAC CAC TAG TTT G CATTCATGTG AAAAGATTC GAA 5' insert 
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FIGURE 7 

DNA AND AMINO ACID SEQUENCES OF H4Vi-CC49 V. SCFV 



Cl* 1 SCOK Z 



5'-C 


TCA 


TGT 


TTG 


ACA 


GCT 


TAT 


CAT 


CGA 


TGA 


ATT 


CCA 


TCA 


CTT 


CCC 


TCC 


46 


GTT 


CAT 


TTG 


TCC 


CCG 


GTG 


GAA 


ACG 


AGG 


TCA 


TCA 


TTT 


CCT 


TCC 


GAA 


AAA 


94 


ACG 


GTT 


GCA 


TTT 


AAA 


TCT 


TAC 


ATA 


TAT 


AAT 


ACT 


TTC 


AAA 


GAC 


TAC 


ATT 


142 


TGT 


AAG 


ATT 


TGA 


TGT 


TTG 


AGT 


CGG 


CTG 


AAA 


GAT 


CGT 


ACG 


TAC 


CAA 


TTA 


190 


TTG 


TTT 


CGT 


GAT 


TGT 


TCA 


AGC 


CAT 


AAG 


ACT 


GTA 


GGG 


ATA 


GTG 


GAA 


AGA 


238 


GTG 


CTT CAT 


CTG 


GTT 


ACG 


ATC 


AAT 


CAA 


ATA 


TTC 


AAA 


CGG 


AGG 


GAG 


ACG 


286 


ATT 


TTG 


-22 
Met 
ATG 


Lys 
AAA 


Tyr 
TAC 


Leu 
CTA 


Leu 
TTG 


Pro 
CCT 


Thr 
ACG 


Ala 
GCA 


Ala 
GCC 


Ala 
GCT 


Gly 
GGA 


Leu 
TTG 


Leu 
TTA 


Leu 
TTA 


334 


Leu 
CTC 


Ala 
GCT 


Ala 
GCC 


Gin 
CAA 


Pro 
CCA 


Kco X 

Ala Met 
GCC ATG 


Ala 
GCC 


+ 1 
Asp 

GAC 


lie 
ATC 


Val 
GTG 


Met 
ATG 


Thr 
ACC 


Gin 
CAG 


Ser 
TCT 


Pro 
CCA 


382 


Asp 
GAC 


10 

Ser 
TCC 


Leu 
CTG 


Ala 
GCT 


Val 

GTG 


Ser 
TCT 


Leu 
CTG 


Gly 
GGC 


Glu 
GAG 


Arg 
AGG 


Ala 

GCC 


20 
Thr 
ACC 


He 
ATC 


Asn 
AAC 


Cys 
TGC 


Lys 

AAG 


430 


Ser 

TCC 


Ser 

AGC 


Gin 

CAG 


Ser 

AGT 


Val 

GTT 


30 
Leu 
TTA 


Tyr 

TAC 


ser 

AGC 


Ser 

TCC 


Asn 

AAC 


Asn 

AAT 


Lys 

AAG 


Asn 

AAC 


Tyr 

-TAC 


X^eu 
TTA 


40 

AX A 

GCT 


.4.78 


Trp 

TGG 


Tyr 
TAC 


Gin 
CAG 


Gin 
CAG 


Lys 
AAA 


Pro 
CCA 


Gly 
GGA 


Gin 
CAG 


Pro 
CCT 


50 
Pro 
CCT 


Lys 
AAG 


Leu 
CTG 


Leu 
CTC 


He 
ATT 


Tyr 
TAC 


Trp 

TGG 


526 


Ala 

GCA 


Ser 

TCT 


Tbr 

ACC 


60 
Arg 

CGG 


Glu 

GAA 


Ser 

TCC 


Gly 

GGG 


Val 
GTC 


Pro 
CCT 


Asp 
GAC 


Arg 
CGA 


Phe 
TTC 


Ser 
AGT 


"70 
Gly 
GGC 


Ser 
AGC 


Gly 
GGG 


574 


Ser 
TCT 


Gly 
GGG 


Thr 
ACA 


Asp 
GAT 


Phe 
TTC 


Thr 
ACT 


Leu 
CTC 


80 
Thr 
ACC 


lie 
ATC 


Ser 
AGC 


Ser 
AGC 


Leu 
CTG 


Gin 
CAG 


Ala 
GCT 


Glu 
GAA 


Asp 
GAT 


622 


Val 
GTG 


90 
Ala 
GCA 


Val 
GTT 


Tyr 
TAT 


Tyr 
TAC 


Cys 
TGT 


Gin 

CAG 


Gin 

CAA 


Tyr 

TAT 


Tyr 

TAT 


Ser 

AGT 


100 
Tyr 

TAT 


Pro 

CCT 


Leu 

CTC 


Thr 

ACT 


Phe 
TTC 


670 


Gly 
GGC 


Gly 
GGA 


Gly 

GGG 


Thr 


Lys 
AAG 


110 
Val 


Val 


lie 


Bind III 
Lvs Leu Ser 


Ala 


Asp 


Asp 


Ala 


120 
Lvs 




ACC 


GTG 


GTG 


ATC 


AAG 


CTT 


AGT 


GCG 


GAC 


GAT 


GCG 


AAA 


718 
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FIGURE 7 Continued 
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Asp Ala 


Ala 


Lys 


Lys 


Asp Asp Ala 


Lvs 


Lys 


Asp Asp Ala 


Lys 


Lvs 




AAG 


GAT 


GCT 


GCG 


AAG 


AAG 


GAT 


GAC 


GCT 


AAG 


AAA 


GAC 


GAT 


GCT 


AAA 


AAG 


766 


Asp 

GAC 


Xho 
Leu 

CTC 


Z 

Glu 
GAG 


140 
Val 
GTT 


Gin 
CAG 


Leu 
TTG 


Gin 
CAG 


Gin 
CAG 


Ser Asp Ala 
TCT GAC GCT 


u ^ u 

GAG 


T .o\) 
u 

TTG 


150 
Val 
GTG 


Lys 
AAA 


Pro 
CCT 


814 


Gly 

GGG 


Ala 

GCT 


Ser 
TCA 


Val 
GTG 


Lys 
AAG 


He 
ATT 


TCC 


160 

P v c 
TGC 


Lys Alai 
AAG GCT 


Ser 
TCT 


o J. y 
GGC 


Tyr 
TAC 


Thr 
ACC 


Phe 
TTC 


Thr 

APT 


ft fi? 

V D £ 


Asp 
GAC 


170 
Bis 
CAT 


Ala 

GCA 


He 

ATT 


His 

PAP 


Trp 

1 oo 


va i. 
GTG 


Ly s 
AAA 


Gin 
CAG 


Asn 
AAC 


Pro 
CCT 


180 
GAA 


Gin 
CAG 


Gly 
GGC 


Leu 
CTG 


Glu 


Q1 (\ 
7l U 


Trp 

TGG 


He 

ATT 


Gly Tyr 

GGA TAT 


Phe 

TTT 


190 
Ser 

TPT 


ccc 


GGA 


Asn 

AAT 


Asp Asp 

GAT GAT 


Fne 

TTT 


Lys 

AAA 


Tyr Asn 

TAC AAT 


200 
Glu 


Q ^ fi 


Arg Phe 

AGG TTC 


Lys 

AAG 


Gly 

GGC 


Lys 


Ala 


Thr 
ACA 


Leu 
CTG 


Thr 
ACT 


210 
Ala 
GCA 


Asp 
GAC 


Lys 
AAA 


Ser 
TCC 


Ser 
TCC 


Ser 
AGC 


Thr 
ACT 


IQOd 


Ala 
GCC 


Tyr 
TAC 


Val 
GTG 


220 
Gin 
CAG 


Leu 
CTC 


Asn 
AAC 


Ser 
AGC 


Leu 
CTG 


Thr 
ACA 


Ser 
TCT 


Glu 
GAG 


Asp 
GAT 


Ser 
TCT 


230 
Ala 
GCA 


Val 
GTG 


Tyr 
TAT 


1054 


Phe 
TTC 


Cys 
TGT 


Thr 
ACA 


Arg 
AGA 


Ser 

TCC 


Leu 

CTG 


Asn 

AAT 


240 
Met 

ATG 


Ala 

GCC 


Tyr 

TAC 


Trp 

TGG 


Gly 

GGT 


Gin 
CAA 


Gly 
GGA 


Thr 
ACC 


Ser 
TCA 


1102 


Val 

GTC 


250 
Thr 

ACC 


Val 
GTC 


Ser 
TCC 


Ser 
TCA 


* * * 

TAA 


AAA 


Nhe 

GCT 


I 

AGC 


GAT 


GAA 


TCC 


GTC 


AAA 


ACA 


TCA 


1150 


TCT 


TAC 


ATA 




TCA 


CTT 


GGT 


GAT 


CAA 


GCT 


CAT 


ATC 


ATT 


GTC 


CGG 


CAA 


1198 


TGG 


TGT 


GGG 


CTT 


TTT 


TTG 


TTT 


TCT 


ATC 


TTT 


AAA 


GAT 


CAT 


GTG 


AAG 


AAA 


1246 


AAC 


GGG 


AAA 


ATC 


GGT 


CTG 


CGG 


GAA 


AGG 


ACC 


GGG 


TTT 


TTG 


TCG 


AAA 


TCA 


1254 


TAG 


GCG 


AAT 


GGG 


TTG 


GAT 


TGT 


GAC 


AAA 


ATT 


fiamH X 

CGG ATC 


C-3' 








1291 
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FIGURE e 

DKA AND AMINO ACID SEQUENCES OF CC49 SCFV SPECIES 



Cla X BcoK I 



5'-C 


TCA 


TGT 


TTG 


ACA 


GCT 


TAT 


CAT 


CGA 


TGA 


ATT 


CCA 


TCA 


CTT 


CCC 


TCC 


46 


GTT 


CAT 


TTG 


TCC 


CCG 


GTG 


GAA 


ACG 


AGG 


TCA 


TCA 


TTT 


CCT 


TCC 


GAA 


AAA 


94 


ACG 


GTT 


GCA 


TTT 


AAA 


TCT 


TAC 


ATA 


TAT 


AAT 


ACT 


TTC 


AAA 


GAC 


TAC 


ATT 


142 


TGT 


AAG 


ATT 


TGA 


TGT 


TTG 


AGT 


CGG 


CTGT 


AAA 


GAT 


CGT 


ACG 


TAC 


CAA 


TTA 


190 


TTG 


TTT 


CGT 


GAT 


TGT 


TCA 


AGC 


CAT 


AAC 


ACT 


GTA 


GGG 


ATA 


GTG 


GAA 


AGA 


238 


GTG 


CTT 


CAT 


CTG 


GTT 


ACG 


ATC 


AAT 


CAA 


ATA 


TTC 


AAA 


CGG 


AGG 


GAG 


ACG 


286 


ATT 


TTG 


-22 
Met 
ATG 


Lys 
AAA 


Tyr 
TAC 


Leu 
CTA 


Leu 
TTG 


Pro 
CCT 


Thr 
ACG 


Ala 
GCA 


Ala 
GCC 


Ala 
GCT 


Gly 
GGA 


Leu 
TTG 


Leu 
TTA 


Leu 
TTA 


334 


Leu 
CTC 


Ala 
GCT 


Ala 

GCC 


Gin 
CAA 


Pro 
CCA 


KCO I 

Ala Met 

GCC ATG 


Ala 

GCC 


♦ 1 
Asp 

GAC 


He 
ATT 


Val 
GTG 


Met 
ATG 


Ser 
TCA 


Gin 
CAG 


Ser 
TCT 


Pro 

CCA 


382 


Ser 
TCC 


10 
Ser 
TCC 


Leu 
CTA 


Pro 
CCT 


Val 
GTG 


Ser 
TCA 


Val 
GTT 


Gly 
GGC 


Glu 
GAG 


AAG 


Val 
GTT 


20 
Thr 
ACT 


Leu 
TTG 


Ser 
AGC 


Cys 
TGC 


Lys 

AAG 


430 


Ser 

TCC 


C A ^ 

AGT 


Gin 

CAG 


AGC 


Z«eu 
CTT 


30 
Leu 
TTA 


Tyr 

TAT 


AGT 


Gly 

GGT 


Asn 

AAT 


Gla 

CAA 


AAG 


AAC 


Tyr 

TAC 


TTG 


40 

GCC 


478 


Trp 
TGG 


Tyr 
TAC 


Gin 
CAG 


Gin 
CAG 


Lys 
AAA 


Pro 
CCA 


Gly 
GGG 


Gin 
CAG 


Ser 
TCT 


50 
Pro 
CCT 


Lys 
AAA 


Leu 
CTG 


Leu 
CTG 


He 
ATT 


Tyr 
TAC 


Trp 

TGG 


526 


Ala 

GCA 


Ser 

TCC 


Ala 

GCT 


60 
Arg 

AGG 


Glu 

GAA 


Ser 
TCT 


Gly 
GGG 


Val 
GTC 


Pro 
CCT 


Asp 
GAT 


Arg 
CGC 


Phe 
TTC 


Thr 
ACA 


70 
Gly 
GGC 


Ser 
AGT 


Gly 
GGA 


574 


Ser 
TCT 


Gly 
GGG 


Thr 
ACA 


Asp 
GAT 


Phe 
TTC 


Thr 
ACT 


Leu 
CTC 


80 

Ser 
TCC 


He 
ATC 


Ser 
AGC 


Ser 
AGT 


Val 
GTG 


Lys 
AAG 


Thr 
ACT 


Glu 
GAA 


Asp 
GAC 


622 


Leu 
CTG 


90 
Ala 
GCA 


Val 
GTT 


Tyr 
TAT 


Tyr 
TAC 


Cys 
TGT 


Gin 

CAG 


Gin 

CAG 


Tyr 
TAT 


Tyr 

TAT 


Ser 

AGC 


100 
Tyr 
TAT 


Pro 

CCC 


Leu 

CTC 


Thr 

ACG 


Phe 
TTC 


670 


Gly 
GGT 


Ala 


Gly 
GGG 


Thr 


Lys 
AAG 


110 
Leu 


Val 


Leu 


Hind III 
Lvs Leu Ser 


Ala 


Asp 


Asp 


Ala 


120 
Lvs 




GCT 


ACC 


CTG 


GTG 


CTG 


AAG 


CTT 


AGT 


GCG 


GAC 


GAT 


GCG 


AAA 


718 
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Lvs Asp Ala Xla Lva Lvs Asp Asp Ala Lva Lvs Asp Aap Ala Lya Lva 

AAG GAT OCT GCG AAG AAG GAT GAC GCT AAG AAA GAC GAT GCT AAA AAG 



Xho 1 140 150 
Asp Lqu Glu Val Gin Leu Gin Gin Ser Asp Ala Glu Leu Val Lys Pro 
GAC CTC GAG GTT CAG TTG CAG CAG TCT GAC GCT GAG TTG GTG AAA OCT 

160 

Gly Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 
GGG GCT TCA GTG AAG ATT TCC TGC AAG GCT TCT GGC TAG ACC TTC ACT 

170 180 
Asp His Ala lie His Trp Val Lys Gin Asn Pro Glu Gin Gly Leu Glu 
GAC CAT GCA ATT CAC TGG GTG AAA CAG AAC CCT GAA CAG GGC CTG GAA 

190 200 
Trp lie Gly Tyr Ph© Ser Pro Gly Asn Asp Asp Ph© Lys Tyr Asn Glu 

TGG ATT GGA TAT TTT TCT CCC GGA AAT GAT GAT TTT AAA TAC AAT GAG 

210 

Arg Ph© Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr 
AGG TTC AAG GGC AAG GCC ACA CTG ACT GCA GAC AAA TCC TCC AGC ACT 

220 230 
Ala Tyr Val Gin Leu Asn Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr 
GCC TAC GTG CAG CTC AAC AGC CTG ACA TCT GAG GAT TCT GCA GTG TAT 

240 

Phe Cys Thr Arg Ser Leu Asn Met Ala Tyr Trp Gly Gin Gly Thr Ser 
TTC TGT ACA AGA TCC CTG AAT ATG GCC TAC TGG GGT CAA GGA ACC TCA 



250 

. Val Thr Val Ser 
GTC ACC GTC TCC 
250 

. Val Thr Val Ser 
GTC ACC GTC TCC 
250 

. Val Thr Val Ser 
GTC ACC GTC TCC 

Kho X 



Ser •** 
TCA TAA 

Pro Glu Asp Tyr Asp Ser *•* 
CCT GAG GAC TAT GAC TCC TAA 

260 

Pro Glu Asp Pro Glu Asp Tyr Asp 
CCT GAA GAC CCT GAA GAC TAT GAC TAA 



76£ 



614 



862 



910 



956 



1006 



1054 



1102 



1120 



1135 



1141 



AAA 


GCT 


AGC 


GAT 


GAA 


TCC 


GTC 


AAA 


ACA 


TCA 


TCT 


TAC 


ATA 


AAG 


TCA 


CTT 


1169 


GGT 


GAT 


CAA 


GCT 


CAT 


ATC 


ATT 


GTC 


CGG 


CAA 


TGG 


TGT 


GGG 


CTT 


TTT 


TTG 


123-7 


TTT 


TCT 


ATC 


TTT 


AAA 


GAT 


CAT 


GTG 


AAG 


AAA 


AAC 


GGG 


AAA 


ATC 


GGT 


CTG 


1285 


CGG 


GAA 


AGG 


ACC 


GGG 


TTT 


TTG 


TCG 


AAA 


TCA 


TAG 


GCG 


AAT 


GGG 


TTG 


GAT 


1333 



BaaiB I 

TGT GAC AAA ATT CGGATCC-3' 



1354 (A); 1369(B); 1375(C) 
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ANION EXCHANGE CHROMATOGRAPHY & SDS-PAGE OF MONO S 
POOL 2 E. COL/ pSCFV UHM 8.1 PERIPLASMIC FRACTION 




I 1 I I I I 1 I I I I I M I I I I I I I I I I I I I I 

5 10 15 20 25 

Fraction Number 
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PUVSMID CONSTRUCTIONS LEADING TO pPY31& pPY32 



BstB I 



Xho I 
EcoR I 
Sma I 
BamH I 
L ^c^Hindlll 




Digest with Sst I, 
and Sal I 



Isolate vector (5.9 Kbp) 
and insert (2.5 Kbp) 



colEI ori 



Ssll 




B$tX] 




Sac II 




Xma III 




Not I 




Xba 1 




Spel 




BamH I 




Sma 1 
Psti 


Digest with Sst I , 


EcoRI 
EcoR V 


and Sal I 


Htnd III 




Clal 




Sail 




Xho I 




Dra II 




Apa 1 




Kpn I 





Isolate vector 
(-2.8 Kbp) 



Sal I 



pHIL- SI insert 
+ 

pBluescript II vector 



T4DNA Ligase 




BstB I 
EcoR I 

BamH I 
EcoR V 



Sal I 
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Sst I 




Sal I 



scFv 



F. 



pPY21& 
pPY22 



BstB I 



Sst I 
Bglll 



Digest with Sst 1 + Sal I 
Ligate with pHIL - SI 
vector fragment from A 



Pst I 




Bgl II 



FIGURE 15 
Continued 



T 'Sph 1 
BstB I 
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FIGURE 16 



DMA AND AMINO ACID SEQUENCE OF CC49 SCFV IN pPY21 & pPY22 



AOXIP 

TTTAACGACJU^CTTGXGAAGATC 



5'-AOXl- 



BstB X 

(AAAAAACAAC) TAATTATTC 



-22 



MAf Lvs Tvr MPt Leu Len filn Ma Ph*^ Leu Ph** ^.eu Ala 

GAA ACG A?G AA^ TAC ATG CTT TTG CAA GCT TTC CCT TTC CTT TTG GCT 

r,iv Phe Ala T.v. Tl^ Ser Ala Asp lie Val Met Ser Gin Ser Pro 

GGT TTT GCA GCT AAG.^AI A-ICT GCT GAC ATT GTG ATG TCA CAG TCT CCA 

10 20 
Ser Ser Leu Pro Val Ser Val Gly Glu Lys Val Thr Leu Ser Cys Zys 
Kc lie C?A cIt GTG TCA GTT GGC GAG AAG GTT ACT TTG AGC TGC AAG 

30 ^° 
ser ser Gin Ser leu Lev Tyr Ser Gly Asn Gin Lys Asn Jyr Z^u Alj 

TCC AGT CAG AGC CTT TTA TAT AGT GGT AAT CAA AAG AAC TAC TTG ecc 

TA6VLFR2 

TGG TAC CAG CAG AAA. CCA G6 50 , , _ 

Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu lie Tyr Trp 

TGG TAC CAG CAG AAA CCA GGG CAG TCT CCT AAA CTG CTG ATT TAC TGG 

60 ""^ 
AJa Ser AJ« Ax-a Glu Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly 
Gct TCC GCT AGG GAA TCT GGG GTC CCT GAT CGC TTC ACA GGC AGT GGA 

80 

Ser Gly Thr Asp Phe Thr Leu Ser He Ser Ser Val Lys Thr Glu Asp 
TCT GGG ACA GAT TTC ACT CTC TCC A?C AGC AGT GTG AAG ACT GAA GAC 

TAGVLCDR3 
A T 

90 GT CAG CAG TAT TAT AGC TAT CO 

Leu Ala val Tyr Tyr Cys Gin Gin Tyr Tyr Ser Tyr Pro Leu Tbr Phe 
CTG GCA GTT TAT TAC TGT CAG CAG TAT TAT AGC TAT CCC CTC ACG TTC 

110 Hind III ^20 

Glv Ala Gly Thr Lys Leu Val Leu Lys i^u Ser Ala Asp Asp Ala 
GGT GC? GGG IcC AAG CTG GTG CTG A^.G CTT AGT GCG GAC GAT GCO AAA 

130 

i.va Aso Ali . r^^^^ Lva Lvs Asp Asp Ala T.yi. Lys Asp Asp Ala Lys Lys 
AAG GAT GCT GCG AAG AAG GAT GAC GCT AAG AAA GAC GAT GCT AAA AAO 

Xsp Glu ia? Gin Leu Gin Gin Ser Asp Ala Glu Leu Val Lys Pro 

KAC CTC GAG GTT CAG TTG CAG CAG TCT GAC GCT GAG TTG «.G AAA CCT 



57 



105 



153 



201 



249 



297 



345 



393 



441 



489 



537 
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FIGURE 16 Continued 



160 

Glv Ala Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 
GGG GCT TCA GTG AAG ATT TCC TGC AAG GCT TCT GGC TAC ACC TTC ACT 

170 ^ 

AsD Bis Ala Ho Bis Trp Val Lys Gin Asn Pro Glu Gin Gly Leu Glu 
J^C CA? GCA AT? CAC TGG GTG AAA CAG AAC CCT GAA CAG GGC CTG GAA 

TAGVBCDR2 190 

Irl ill III I" aiy A.n Asp Asp P.e Zys Tyr Asn Glu 

III ATT GGA St TT? TCT CCC GGA AAT GAT GAT TTT AAA TAC AAT GAG 

210 

Atq PhB Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser Thr 
AGG TTC AAG GGG AAG GCC ACA CTG ACT GCA GAC AAA TCC TCC AGC ACT 

220 

Ala Tvr Val Gin Leu Asn Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr 
GCC TAC GTG CAG CTC AAC AGC CTG ACA TCC GAG GAT TCT GCA GTG TAT 

240 

Phe Cys Thr Arg Ser Leu Asn Met Ala Tyr Trp Gly Gin Gly Thr Ser 
TTC TGT ACA AGA TCC CTG AAT ATG GCC TAC TGG GGT CAA GGA ACC TCA 



585 



633 



681 



729 



777 



825 



250 

A. Val Thr Val Ser 
GTC ACC GTC TCC 

250 

B • Val Thr Val Ser 
GTC ACC GTC TCC 



260 

Pro Glu Asp Pro Glu Asp Tyr Asp *** *** 
CCT GAA GAC CCT GAA GAC TAT GAC TAA TAG 



«pyj' * * * « « * 

TAC TAA TAG 



867 



846 



BajnH 1 
GGATCCTTAG- 



-AOXl TERMINATOR-3' 877/856 
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FIGURE 23 

DNA AND AMINO ACID SEQUENCE IN pATDFLAG 



Clm 1 XcoK I 



5'-C 


TCA 


TGT 


TTG 


ACA 


GCT 


TAT 


CAT 


CGA 


TGA 


ATT 


CCA 


TCA 


CTT 


CCC 


TCC 


46 


GTT 


CAT 


TTG 


TCC 


CCG 


GTG 


GAA 


ACG 


AGG 


TCA 


TCA 


TTT 


CCT 


tec 


GAA 


AAA 


94 


ACG 


GTT 


GCA 


TTT 


AAA 


TCT 


TAC 


ATA 


TAT^ 


AAT 


ACT 


TTC 


AAA 


GAC 


TAC 


ATT 


142 




AAG 


ATT 


TGA 


TCT 


TTG 


AGT 


CGG 












i AVv 


PA A 


TTA 




TTG 


TTT 


CGT 


GAT 


TGT 


TCA 


AGC 


CAT 


AAC 


ACT 


GTA 


GGG 


ATA 


GTG 


GAA 


AGA 


238 


GTG 


CTT 


CAT 


CTG 


GTT 


ACG 


ATC 


AAT 


CAA 


ATA 


TTC 


AAA 


CGG 


AGG 


GAG 


ACG 


266 


Al I 


1 i U 


-22 
Met 
ATG 


Lys 
AAA 


Tyr 


Leu 
CTA 


Leu 
TTG 


Pro 
CCT 


Thr 


Ala 
GCA 


Ala 
GCC 


Ala 
GCT 


Gly 
GGA 


Leu 
TTG 


Leu 
TTA 


Leu 
TTA 




Leu 
CTC 


Ala 

GCT 


Ala 

GCC 


Gin 
CAA 


Pro 
CCA 


Hco Z 
Ala Met 

GCC ATG 


Ala 

GCC 


41 
Asp 

GAC 


He 
ATC 


Val 
GTG 


Met 
ATG 


Thr 
ACC 


Gin 
CAG 


Ser 
TCT 


Pro 
CCA 


382 


Asp 
GAC 


10 
Ser 
TCC 


Leu 
CTG 


Ala 
GCT 


Val 
GTG 


Ser 
TCT 


Leu 
CTG 


Gly 
GGC 


Glu 
GAG 


Arg 
AGG 


Ala 
GCC 


20 
Thr 
ACC 


He 
ATC 


Asn 
AAC 


Cys 
TGC 


Lys 

AAG 


430 


Ser 

TCC 


Ser 

AGC 


Gin 

CAG 


Ser 

AGT 


Val 

GTT 


30 
Z<eu 

TTA 


Tyr Ser 

TAC AGC 


Ser 

TCC 


Asn 

AAC 


Asn 

AAT 


AAG 


Asn 

AAC 


Tyr 

TAC 


l#eu 
TTA 


Ala 

GCT 


478 


Trp 

TGG 


Tyr 
TAC 


Gin 
CAG 


Gin 
CAG 


Lys 
AAA 


Pro 
CCA 


Gly 
GGA 


Gin 
CAG 


Pro 
CCT 


50 
Pro 
CCT 


Lys 

AAG 


Leu 
CTG 


Leu 
CTC 


He 
ATT 


Tyr 
TAC 


Trp 

TGG 


52 6 


Ala 

GCA 


Ser 

TCT 


60 

Thr Ax9 

ACC CGG 


Glu 

GAA 


Ser 

TCC 


Gly 
GGG 


Val 
GTC 


Pro 
CCT 


Asp 
GAC 


Arg 
CGA 


Phe 
TTC 


Ser 
AGT 


70 
Gly 
GGC 


Ser 
AGC 


Gly 
GGG 


574 


Ser 
TCT 


Gly 
GGG 


Thr 
ACA 


Asp 
GAT 


Phe 
TTC 


Thr 
ACT 


Leu 
CTC 


BO 
Thr 
ACC 


He 
ATC 


Ser 
AGC 


Ser 
AGC 


Leu 
CTG 


Gin 
CAG 


Ala 
GCT 


Glu 
GAA 


Asp 
GAT 


622 


Ala 

GCG 


90 
Ala 
GCA 


Val 
GTT 


Tyr 
TAT 


Tyr 
TAC 


Cys 
TGT 


Gin 

CAG 


Gin 
CAA 


Tyr 

TAT 


Tyr 

TAT 


Ser 
AGT 


100 
Tyr 
TAT 


Pro 

CCT 


Leu 

CTC 


Thr 

ACT 


Phe 
TTC 


670 


Gly 


Gly 


Gly 


Thr 


Lys 


110 

Val 


Val 


He 


Hind III 
Lvs Leu Ser 


Ala 


Asp Asp 


Ala 


120 
Lys 




GGC 


GGA 


GGG 


ACC 


AAG 


GTG 


GTG 


ATC 


AAG 


CTT 


AGT 


GCG 


GAC 


GAT 


GCG 


AAA 


718 
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FIGURE 23 Continued 



130 

Lvs Asp Ala Ala Lvs Lvs Asp Asp Ala Lvs Lvs Asp Asp Ala Lvs I^va 

AAG GAT GCT GCG AAG AAG GAT GAC GCT AAG AAA GAC GAT 6CT AAA AAG 76€ 

Xho I NhB Z 

Asp reu Glu Ala Ser Asp Tyr Ly9 Asp Asp Asp Asp Lys *** 

GAC CTC GAG ACAATGTC GCT AGC GAC TAC AAG GAC GAT GAT GAC AAA TAA 81 € 

AAA OCT AGC GAT GAA TCC GTC AAA ACA TCA TCT TAC A TA A AG TCA CTT 86< 
PENPTSEQ: <* TTT TGT XCT AGA ATG TAT TTC 

GGT GAT CAA GCT CAT ATC ATT GTC CGG CAA TGG TGT GGG CTT TTT TTG 912 

TTT TCT ATC TTT AAA GAT CAT GTG AAG AAA AAC GGG AAA ATC GGT CTG 96C 

CGG GAA AGG ACC GGG TTT TTG TCG AAA TCA TAG GCG AAT GGG TTG GAT 1008 

BaxnR X 

TGT GAC AAA ATT CGGATCC-3' 102"? 



32/38 



BNSDOCID: <WO 940469 1A2 I > 



wo 94/04691 



PCT/US93/07918 



FIGURE 24 

DNA AND AMINO ACID SEQUENCE OF pSC4 9FLAG 

ClM Z KCOR Z 

5'-C TCA TGT TTG' ACA GCT TAT C AT CGA TGA ATT C CA TCA CTT CCC TCC 46 

GTT CAT TTG TCC CCG GTG GAA ACG AGG TCA TCA TTT CCT TCC GAA AAA 94 

ACG GTT GCA TTT AAA TCT TAC ATA TAT AAT ACT TTC AAA GAC TAG ATT 142 

TGT AAG ATT TGA TGT TTG AGT CGG CTG AAA GAT CGT ACG TAC CAA TTA 190 

PENPRl: AAC ACT GTA GGG ATA GTG GAA -> 

TTG TTT CGT GAT TGT TCA AGC CAT AAC ACT GTA GGG ATA GTG GAA AGA 238 

GTG CTT CAT CTG GTT ACG ATC AAT CAA ATA TTC AAA CGG AGG GAG ACG 28 6 

PEKPR2: <- TAT AAC TTT GCC TCC CTC TG 

-22 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu 
ATT TTG ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA 334 

Kco Z +1 

Leu Ala Ala Gin Pro Ala Met Ala Asp He Val Met Thr Gin Ser Pro 

CTC GCT GCC CAA CCA G CC ATG G CC GAC ATC GTG ATG ACC CAG TCT CCA 382 

10 20 
Asp Ser Leu Ala Val Ser Leu Gly Glu Arg Ala Thr He Asn Cys Lys 
GAC TCC CTG GCT GTG TCT CTG GGC GAG AGG GCC ACC ATC AAC TGC AAG 4 30 

30 40 
Pro Ser Gin Ser Val Leu Tyr Ser Ser Asn Asn Lys Asn Tyr Leu Ala 

CCC AGC CAG AGT GTT TTA TAC AGC TCC AAC AAT AAG AAC TAC TTA GCT 478 

TAGVLFR2: 

TGG TAC CAG CAG AAA CCA GG 50 

Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu He Tyr Trp 

TGG TAC CAG CAG AAA CCA GGA CAG CCT CCT AAG CTG CTC ATT TAC TGG 52 6 

60 - -^O 
Ala Ser Thr Krg Glu Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly 

GCA TCT ACC CGG GAA TCC GGG GTC CCT GAC CGA TTC AGT GGC AGC GGG 574 

80 

Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Ala Glu Asp 

TCT GGG ACA GAT TTC ACT CTC ACC ATC AGC AGC CTG CAG GCT GAA GAT 622 

90 100 
Ala Ala Val Tyr Tyr Cys Gin Gin Tyr Tyr Ser Tyr Pro Leu Thr Phe 
GCG GCA GTT TAT TAC TGT CAG CAA TAT TAT AGT TAT CCT CTC ACT TTC 670 

110 Hind III 120 

Gly Gly Gly Thr Lys Val Val He Lys Leu Ser Ala Asp Asp Ala Lys 
GGC GGA GGG ACC AAG GTG GTG ATC AAG CTT AGT GCG GAC GAT GCG AAA 718 
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FIGURE 24 Continued 



LV8 


GAT 
Asp 


GCT 
Ala 


OHVHSEQ: 
GGG AAG AAG 
Ala Lvs Lvs 


GAT 
Asp 


OAC 
Asp 


G 

Ala 


130 
Lvs 


Lvs 


Asp 


Asp 


Ala Lvs 


Lvs 


766 


AAG 


GAT 


GCT 


GCG 


AAG 


AAG 


GAT 


GAC 


GCT 


AAG 


AAA 


GAC 


GAT 


GCT A^ 


AAG 


Asp 

GAC 


XhO 
Z«eu 

CTC 


X 

Glu 
GAG 


140 

Val 
GTT 


Gin 
CAG 


Leu 
TTG 


Gin 
CAG 


Gin 
CAG 


Ser 
TCT 


Asp 
GAC 


Ala 
GCT 


Glu 
GAG 


Leu 
TTG 


150 

Val Lys 
GTG AAA 


Pro 
CCT 


814 


Gly 
GGG 


Ala 

GCT 


Ser 
TCA 


Val 
GTG 


Lys 
AAG 


He 
ATT 


Ser 

TCC 


160 

Cys 
TGC 


Lys 
AAG 


;Cla 
GCT 


Ser 
TCT 


Gly 

GGC 


Tyr 
TAC 


Thr Pne 
ACC TTC 


Thr 
ACT 


862 


Asp 

GAC 


170 
Bis 
CAT 


Ala 

GCA 


He 

ATT 


Bis 

CAC 


Trp 
TGG 


Val 
GTG 


Lys 
AAA 


Gin 
CAG 


Asn 
AAC 


Pro 
CCT 


180 
Glu 
GAA 


TAGVHCDR2: A 
Gin Gly Leu Glu 
CAG GGC CTG GAA 


910 


TGG 
Trp 
TGG 


ATT 

lie 
ATT 


GGA 
Gly 
GGA 


TAT 
Tyr 

TAT 


TTT 
Pbe 

TTT 


TCT 
Ser 
TCT 


CC 

Pro Gly 

CCC GGA 


Asn Asp Asp 

AAT GAT GAT 


Phe 

TTT 


Lys 
AAA 


Tyr Asn 

TAC AAT 


200 
Glu 
GAG 


956 


Arg 

AGG 


Phe 

TTC 


I«ys 
AAG 


Gly 

GGC 


Lys 
AAG 


Ala 
GCC 


Thr 
ACA 


Leu 
CTG 


Thr 
ACT 


210 
Ala 
GCA 


Asp 
GAC 


Lys 
AAA 


Ser 
TCC 


Ser Ser 
TCC AGC 


Thr 
ACT 


1006 


Ala 
GCC 


Tyr 
TAG 


Val 
GTG 


220 
Gin 


Leu 
CTC 


Asn 
AAC 


Ser 
AGC 


Leu 
CTG 


Thr 
ACA 


Ser 
TCT 


Glu 
GAG 


Asp 
GAT 


Ser 
TCT 


230 

Ala Val 
GCA GTG 


Tyr 
TAT 


1054 


Pne 
TTC 


Cys 
TGT 


Thr 
ACA 


Arg 
AGA 


Ser 

TCC 


Leu 

CTG 


TAGVBrR4 
ASD Met Ala 

AAT ATG GCC 


: AC 
Tyr 

TAC 


TGG 

Trp 
TGG 


GGT 
Gly 

GGT 


CAA 

Gin 
CAA 


GGA ACC 

Gly Thr 
GGA ACC 


TCA 

Ser 
TCA 


1102 


G -a 

Val 
GTC 


> 250 
Thr Val 
ACC GTC 


Ser 
TCC 


Ser 
TCA 


Khe X 

Ala Ser 
GCT AGC 


Asp 

GAC 


260 

Tyr Lys Asp Asp Asp Asp Lys 

TAC AAG GAC GAT GAT GAC AAA 


* * * 

TAA 


1150 


AAA 


OCT 


AGO 


GAT 


GAA 


TCC 


GTC 


AAA 


ACA 


TCA 


TCT 


TAC 


ATA 


AAG TCA 


CTT 


1198 


GGT 


GAT 


CAA 


GCT 


CAT 


ATC 


ATT 


GTC 


CGG 


CAA 


TGG 


TGT 


GGG 


CTT TTT 


TTG 


1246 


TTT 


TCT 


ATC 


TTT 


AAA 


GAT 


CAT 


GTG 


AAG 


AAA 


AAC 


GGG 


AAA 


ATC GGT 


CTG 


1294 


CGG 


GAA 


AGG 


ACC 


GGG 


TTT 


TTG 


TCG 


AAA 


TCA 


TAG 


GCG 


AAT 


GGG TTG 


GAT 


1342 


TGT 


GAC 


AAA 


ATT 


BaaR X 
CGG ATC 


C-3 


















1361 
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